正则表达式替换html标签之外的文本

我有这个HTML：

"This is simple html text simple simple text text text"

我只需要匹配任何HTML标记之外的单词。我的意思是，如果我想匹配“简单”和“文本”，则只能从“这是简单的html文本”和最后一部分“文本”中得到结果-结果将是“简单” 1匹配，“文本” 2火柴。有人可以帮我吗？我正在使用jQuery。

var pattern = new RegExp("(\\b" + value + "\\b)", 'gi');

if (pattern.test(text)) {

text = text.replace(pattern, "$1");

}

value 是我要匹配的单词（在这种情况下为“简单”）

text 是 "This is simple html text simple simple text text text"

我需要用来包装所有选定的单词（在此示例中为“简单”）。但是我只想包装任何 HTML标记之外的词。这个例子的结果应该是

This is simple html text simple simple text text text

我不想替换里面的任何文本

simple simple text text

它应与更换前的相同。

江户川乱折腾

浏览 991回答 2

2回答

白衣染霜花

好的，尝试使用此正则表达式：(text|simple)(?![^<]*>|[^<>]*</)该示例在regex101上工作。分解：(         # Open capture group  text    # Match 'text'|         # Or  simple  # Match 'simple')         # End capture group(?!       # Negative lookahead start (will cause match to fail if contents match)  [^<]*   # Any number of non-'<' characters  >       # A > character|         # Or  [^<>]*  # Any number of non-'<' and non-'>' characters  </      # The characters < and /)         # End negative lookahead.否定的超前查询将阻止html标签之间的text或simple。

随时随地看视频慕课网APP