Regular expressions preview

2025-08-21

Applies to: SQL Server 2025 (17.x) Preview Azure SQL Database Azure SQL Managed Instance SQL database in Microsoft Fabric Preview

This article introduces regular expressions for SQL Server.

Note

As a preview feature, the technology presented in this article is subject to Supplemental Terms of Use for Microsoft Azure Previews.

A regular expression, or regex, is a sequence of characters that defines a search pattern for text. Regex is commonly used for a variety of tasks including pattern matching, data validation, data transformation, and querying. It offers a flexible and an efficient way to search, manipulate, and handle complex data operations.

Note

Regular expressions are available in Azure SQL Managed Instance configured with the Always-up-to-date update policy.

This implementation of regular expression is based on the RE2 regular expression library. For more information, visit RE2 Regular Expression Syntax.

Supported regex functions:

Function	Description
REGEXP_LIKE	Returns a Boolean value that indicates whether the text input matches the regex pattern.
REGEXP_REPLACE	Returns a modified source string replaced by a replacement string, where occurrence of the regex pattern found.
REGEXP_SUBSTR	Extracts parts of a string based on a regular expression pattern. Returns Nth occurrence of a substring that matches the regex pattern.
REGEXP_INSTR	Returns the starting or ending position of the matched substring, depending on the option supplied.
REGEXP_COUNT	Returns a count of the number of times that regex pattern occurs in a string.
REGEXP_MATCHES	Returns a table of captured substring(s) that match a regular expression pattern to a string. If no match is found, the function returns no row.
REGEXP_SPLIT_TO_TABLE	Returns a table of strings split, delimited by the regex pattern. If there's no match to the pattern, the function returns the string.

Regular expressions can be composed of literal characters and metacharacters, which have special meanings and functions.

A basic regular expression is a single literal character. Characters match themselves, except for the metacharacters. Metacharacters include *, +, ?, (, ), or |. To match a metacharacter, escape it with a backslash. For example, \* matches the literal asterisk (*) character.

Two regular expressions can be altered or concatenated to form a new regular expression: if e1 matches s and e2 matches t, then e1 | e2 matches s or t, and e1 e2 matches st.

The metacharacters *, +, and ? are repetition operators: e1 * matches a sequence of zero or more (possibly different) strings, each of which match e1; e1 + matches one or more; e1 ? matches zero or one.

Operator precedence, from weakest to strongest binding, is as follows: - Alternation - Concatenation - Repetition operators

Explicit parentheses can be used to force different meanings, as in arithmetic expressions. Some examples: ab|cd is equivalent to (ab)|(cd) ; ab is equivalent to a(b).

For more information about the regular expression syntax accepted by RE2, see
RE2 Regular Expression Syntax. This page also lists some syntax accepted by PCRE, Perl, and Vim.

Accepted regular expression characters

Single character expressions

Kinds of single-character expressions	Examples
any character, possibly including newline (s=true)	`.`
character class	`[xyz]`
negated character class	`[^xyz]`
Perl character classes	`\d`
negated Perl character class	`\D`
ASCII character classes	`[[:alpha:]]`
negated ASCII character class	`[[:^alpha:]]`
Unicode character class (one-letter name)	`\pN`
Unicode character class	`\p{Greek}`
negated Unicode character class (one-letter name)	`\PN`
negated Unicode character class	`\P{Greek}`

Composites	Description
`xy`	x followed by y
`x \| y`	x or y (prefer x)

Repetitions	Description
`x*`	zero or more x, prefer more
`x+`	one or more x, prefer more
`x?`	zero or one x, prefer one
`x{n,m}`	n or n+1 or ... or m x, prefer more
`x{n,}`	n or more x, prefer more
`x{n}`	exactly n x
`x*?`	zero or more x, prefer fewer
`x+?`	one or more x, prefer fewer
`x??`	zero or one x, prefer zero
`x{n,m}?`	n or n+1 or ... or m x, prefer fewer
`x{n,}?`	n or more x, prefer fewer
`x{n}?`	exactly n x

Note

The counting forms x{n,m}, x{n,}, and x{n} reject forms that create a minimum or maximum repetition count above 1,000. Unlimited repetitions aren't subject to this restriction.

Perl character classes

The following table lists currently supported Perl character classes.

Perl character classes (all ASCII-only)	Description
`\d`	digits ( `[0-9]`)
`\D`	not digits ( `[^0-9]`)
`\s`	whitespace ( `[\t\n\f\r ]`)
`\S`	not whitespace ( `[^\t\n\f\r ]`)
`\w`	word characters ( `[0-9A-Za-z_]`)
`\W`	not word characters ( `[^0-9A-Za-z_]`)

ASCII character classes

The following table lists currently supported ASCII character classes.

ASCII character classes	Description
`[[:alnum:]]`	alphanumeric ( `[0-9A-Za-z]`)
`[[:alpha:]]`	alphabetic ( `[A-Za-z]`)
`[[:ascii:]]`	ASCII ( `[\x00-\x7F]`)
`[[:blank:]]`	blank ( `[\t ]`)
`[[:cntrl:]]`	control ( `[\x00-\x1F\x7F]`)
`[[:digit:]]`	digits ( `[0-9]`)
`[[:graph:]]`	graphical ( `[!-~]` or [A-Za-z0-9!"#$%&'()*+,\-./:;<=>?@[\\\]^_`{\|}~])
`[[:lower:]]`	lower case ( `[a-z]`)
`[[:print:]]`	printable ( `[ -~]` or `[ [:graph:]]`)
`[[:punct:]]`	punctuation ( [!-/:-@[-\``{-~])
`[[:space:]]`	whitespace ( `[\t\n\v\f\r ]`)
`[[:upper:]]`	upper case ( `[A-Z]`)
`[[:word:]]`	word characters ( `[0-9A-Za-z_]`)
`[[:xdigit:]]`	hex digit ( `[0-9A-Fa-f]`)

Literal characters

Letters
Numbers
Symbols

Metacharacters

Empty strings	Description
`^`	At beginning of text or line (`m=true`)
`$`	At end of text (like `\z`, not `\Z`) or line (`m=true`)
`\A`	At beginning of text
`\b`	At ASCII word boundary (`\w` on one side and `\W`, `\A`, or `\z` on the other)
`\B`	Not at ASCII word boundary
`\z`	At end of text

Groupings

The capture group (subexpression) of the pattern with:

Grouping	Description
`(re)`	Numbered capturing group (submatch)
`(?P<name>re)`	Named and numbered capturing group (submatch)
`(?:re)`	Non-capturing group
`(?<flags>)`	Set `<flags>` within current group; non-capturing
`(?<flags>:re)`	Set `<flags>` during `re`; non-capturing Grouping

Flags

Use flags to modify the expression behavior. For example:

Flag	Description
`i`	Case-insensitive (default `false`)
`m`	Multi-line mode: `^` and `$` match begin/end line in addition to begin/end text (default `false`)
`s`	Let `.` match `\n` (default `false`)
`c`	Case-sensitive (default `true`)

This implementation supports the POSIX standard of regular expressions following RE2, and has support for the PCRE/PCRE2 flavor of regular expressions syntax, which is compatible with most modern regular expression engines and tools. There are different flavors of regular expressions, such as POSIX, ANSI, Perl, and PCRE, which have different syntax and features. For more information about supported constructs and behavior of the underlying regex engine, see RE2, a regular expression library.

Requirements

A SQL client tool, such as Azure Data Studio, SQL Server Management Studio, or Visual Studio Code.
A basic knowledge of SQL syntax and queries.
A basic understanding of regular expression syntax and concepts.

FAQ

What is the performance impact of using regex?

Regex queries can have a performance impact depending on the complexity of the regex pattern, the size of the text data, and the number of rows involved. You can use the execution plan and statistics to monitor and optimize the performance of your regex queries.

Known Issues, behaviors, and limitations

These are the items that aren't currently supported in this preview:

LOB data types (varchar(max) or nvarchar(max)) for string_expressions are supported in the following intrinsic functions:
- REGEXP_LIKE
- REGEXP_COUNT
- REGEXP_INSTR
- REGEXP_REPLACE
- REGEXP_SUBSTR

However, this support is limited to input sizes up to 2 MB.

LOB data types aren't supported in the regex table-valued functions (TVFs):
- REGEXP_MATCHES
- REGEXP_SPLIT_TO_TABLE
Regular expression functions aren't supported on memory-optimized OLTP tables.

Regular expressions functions (Transact-SQL) preview

Share via

Regular expressions preview

Accepted regular expression characters

Single character expressions

Perl character classes

ASCII character classes

Literal characters

Metacharacters

Groupings

Flags

Requirements

FAQ

What is the performance impact of using regex?

Known Issues, behaviors, and limitations

Related content

Feedback

Additional resources