Target Text Extractor - Text Pattern Searching and Extracting Online Tool

TARGET TEXT EXTRACTOR

This tool scans the provided input for a pre-defined character pattern, line by line, and outputs the matches (or whole positive or negative lines).

Text patterns can be defined in following ways:

Enter a text that precedes or follows (or both) the desired character strings. Put backslash before special characters (.+*?|^$()[]{}\).
Enter a Javascript regular expression defining a character pattern that precedes or follows (or both) the desired character strings.
Directly define the text pattern to be searched for as a Javascript regular expression. If e. g. phone numbers are to be extracted this would be the expression "\d{3}-\d{4}" for U.S. phone numbers in the format XXX-XXXX.

Close

REGULAR EXPRESSIONS

Metacharacters

Metacharacters are characters with a special meaning:

.	Find a single character, except newline or line terminator
\w	Find a word character (a letter, a digit or an underscore)
\W	Find a non-word character
\d	Find a digit
\D	Find a non-digit character
\s	Find a whitespace character
\S	Find a non-whitespace character
\b	Find a match at the beginning/end of a word
\B	Find a match not at the beginning/end of a word
\t	Find a tab character

Brackets

Brackets are used to find a range of characters:

[abc]	Find any character between the brackets
[^abc]	Find any character NOT between the brackets
[0-4]	Find any digit between the brackets
[^3-6]	Find any digit NOT between the brackets
(x\|y)	Find any of the alternatives specified

Quantifiers

n+	Matches any string that contains one or more n
n*	Matches any string that contains zero or more n's
n?	Matches any string that contains zero or one n
n{X}	Matches any string that contains a sequence of X n's
n{X,Y}	Matches any string that contains a sequence of X to Y n's
n{X,}	Matches any string that contains a sequence of at least X n's
n$	Matches any string with n at the end of it
^n	Matches any string with n at the beginning of it
n(?=x)	Matches any string n that is followed by a specific string x
n(?!x)	Matches any string n that is not followed by a specific string x
n+?, n*?, n??	Lazy match (shortest possible) instead of the default greedy match (longest possible)

Close

SIMPLE AND ADVANCED MODE

In "simple mode", any text entered in the text pattern definition fields (PRECEDED BY and FOLOWED BY) will be searched for "as is". In advanced mode, the text patterns will be interpreted as regular expressions so it is intended for users who are familiar with them.

Close

TEXT TO EXTRACT

An extraction pattern not found among the drop-down menu options has to be provided by the user (after selection of one of the last two options). The pattern can be composed of normal characters and reserved characters with a special meaning (.+*?|^$()[]{}\). To avoid mistaken interpretation of normal characters as special characters, use a preceding backslash character, e.g. "\." to mark a dot that should be interpreted as dot and not as "any character", which is the meaning of the special character "." (dot). To display an overview of special characters click the "Regex Syntax" button.

Close

DNA Sequence Tools	Restriction Analyzer Silent Mutator Random Sequence Generator
Text and Data Tools	Target Text Extractor Multiple List Comparator List Operations Table Operations Two-table Operations Random Gene Set Generator
Lab Calculators	DNA Calculator Chemical Calculator Buffer Calculator Centrifugation
Math Calculators	Combinatorics Poisson Distribution Binomial Distribution Hypergeometric Distribution Geometric Distribution Negative Binomial Distribution

Target Text Extractor (Advanced)

Input Text

Output Text

Settings

Metacharacters

Brackets

Quantifiers