JavaScript RegExp教程

RegExp（正则表达式）是一种用于匹配文本的强大工具。在JavaScript中，可以使用RegExp对象来执行诸如搜索、替换等操作。

正则表达式的基础语法

正则表达式由简单的字符、元字符和特殊字符组成。

简单的字符：例如字母和数字，它们按字面意义匹配文本。例如，正则表达式 /hello/ 在搜索文本中包含 hello 的所有实例。
元字符：元字符具有特殊含义，例如字符集、边界和重复。例如，元字符 . 可以匹配任何字符，而元字符 * 可以匹配零个或多个先前字符。
特殊字符：有些字符表示特殊的用途，例如转义字符 \，它可以用于转义其他字符。

为了使用正则表达式，需要创建一个RegExp对象或在正则表达式字面量中使用斜杠（/）。

示例代码：

// 创建RegExp对象
let re = new RegExp('hello');
let result = re.test('hello world');
console.log(result); // true

// 使用正则表达式字面量
let re2 = /world/;
let result2 = re2.test('hello world');
console.log(result2); // true

正则表达式的匹配模式

正则表达式可以使用匹配模式来确定匹配的方式。以下是常见的匹配模式：

i：表示不区分大小写
g：表示全局匹配
m：表示多行匹配

可以将这些模式与正则表达式字面量一起使用，也可以通过RegExp对象的属性进行设置。

示例代码：

// i模式示例
let re = /hello/i;
let result = re.test('Hello World');
console.log(result); // true

// g模式示例
let re2 = /hello/g;
let str2 = 'hello hello hello';
let result2 = str2.match(re2);
console.log(result2); // ["hello", "hello", "hello"]

// m模式示例
let re3 = /^hello/im;
let str3 = 'hello\nworld\nhello';
let result3 = str3.match(re3);
console.log(result3); // ["hello", "hello"]

正则表达式的元字符

元字符是构成正则表达式的关键部分。以下是常用的元字符：

.：匹配任意字符
[ ]：字符集，匹配范围内的任意一个字符
[^ ]：否定的字符集，匹配不在范围内的任何字符
*：匹配零个或多个先前的字符
+：匹配一个或多个先前字符
?：匹配前导字符的零个或一个实例
( )：捕获分组，用于捕获正则表达式的一部分以供后用
{ }：匹配指定次数的前导字符

示例代码：

// .元字符示例
let re = /.o/;
console.log(re.test('hello')); // true
console.log(re.test('world')); // false

// [ ]字符集示例
let re2 = /[aeiou]/;
console.log(re2.test('hello')); // true
console.log(re2.test('world')); // false

// [^ ]否定字符集示例
let re3 = /[^aeiou]/;
console.log(re3.test('hello')); // false
console.log(re3.test('world')); // true

// *元字符示例
let re4 = /a*/;
console.log(re4.test('hello')); // true
console.log(re4.test('world')); // true
console.log(re4.test('abc')); // false

// +元字符示例
let re5 = /a+/;
console.log(re5.test('hello')); // false
console.log(re5.test('world')); // false
console.log(re5.test('aaa')); // true

// ?元字符示例
let re6 = /t?p/;
console.log(re6.test('top')); // true
console.log(re6.test('tip')); // true
console.log(re6.test('sip')); // false

// ( )捕获分组示例
let re7 = /(hello) (world)/;
let str7 = 'hello world';
let result7 = str7.match(re7);
console.log(result7); // ["hello world", "hello", "world"]

// { }匹配指定次数示例
let re8 = /a{3}/;
console.log(re8.test('aaa')); // true
console.log(re8.test('aa')); // false
console.log(re8.test('aaaa')); // true

正则表达式的特殊字符

正则表达式包括一些特殊字符，它们用于执行不同的功能。以下是常见的特殊字符：

\：转义字符，用于转义其他字符
^：匹配文本的开头
$：匹配文本的结尾
\b：匹配单词边界
\d：匹配数字字符
\D：匹配非数字字符
\s：匹配空格字符
\S：匹配非空格字符
\w：匹配单词字符
\W：匹配非单词字符

示例代码：

// \转义字符示例
let re = /\./;
console.log(re.test('hello.world')); // true
console.log(re.test('helloworld')); // false

// ^匹配文本开头示例
let re2 = /^hello/;
console.log(re2.test('hello world')); // true
console.log(re2.test('world hello')); // false

//  $匹配文本结尾示例 let re3 = /world$ /;
console.log(re3.test('hello world')); // true
console.log(re3.test('hello earth')); // false

// \b匹配单词边界示例
let re4 = /\bhello\b/;
console.log(re4.test('hello world')); // true
console.log(re4.test('helloearth')); // false

// \d匹配数字字符示例
let re5 = /\d+/;
console.log(re5.test('hello 123')); // true
console.log(re5.test('hello earth')); // false

// \D匹配非数字字符示例
let re6 = /\D+/;
console.log(re6.test('hello 123')); // true
console.log(re6.test('helloearth')); // true

// \s匹配空格字符示例
let re7 = /\s+/;
console.log(re7.test('hello world')); // true
console.log(re7.test('helloearth')); // false

// \S匹配非空格字符示例
let re8 = /\S+/;
console.log(re8.test('hello world')); // true
console.log(re8.test('hello  earth')); // true

// \w匹配单词字符示例
let re9 = /\w+/;
console.log(re9.test('hello world')); // true
console.log(re9.test('!@# $%')); // false // \W匹配非单词字符示例 let re10 = /\W+/; console.log(re10.test('hello world')); // false console.log(re10.test('!@#$ %')); // true

正则表达式的高级技巧

正则表达式是一个广泛使用的工具，可用于处理文本的各种复杂问题。以下是一些高级技巧：

贪婪与非贪婪匹配：默认情况下，正则表达式会尽可能多地匹配文本。可以使用 ? 将其改为非贪婪模式。
反向引用：使用捕获分组，可以在正则表达式中引用先前捕获的文本。可以使用反斜杠和分组编号（例如 \1）来完成。
前后查找：可以在正则表达式中使用前后查找，以匹配前面或后面具有特定字符或模式的文本。

示例代码：

// 贪婪与非贪婪匹配示例
let re = /<.+?>/;
let str = '<b>hello</b> world';
let result = str.match(re);
console.log(result); // ["<b>", "</b>"]

let re2 = /<.+>/;
let str2 = '<b>hello</b> world';
let result2 = str2.match(re2);
console.log(result2); // ["<b>hello</b>"]

// 反向引用示例
let re3 = /([a-z]+) \1/;
let str3 = 'hello hello';
let result3 = str3.match(re3);
console.log(result3); // ["hello hello", "hello"]

// 前后查找示例
let re4 = /(?<=hello) world/;
let str4 = 'hello world';
let result4 = str4.match(re4);
console.log(result4); // [" world"]

let re5 = /world (?=hello)/;
let str5 = 'world hello';
let result5 = str5.match(re5);
console.log(result5); // ["world "]