Struct Captures
pub struct Captures<'h> { /* private fields */ }
dep_regex_lite
only.Expand description
Represents the capture groups for a single match.
Capture groups refer to parts of a regex enclosed in parentheses. They can be optionally named. The purpose of capture groups is to be able to reference different parts of a match based on the original pattern. For example, say you want to match the individual letters in a 5-letter word:
(?<first>\w)(\w)(?:\w)\w(?<last>\w)
This regex has 4 capture groups:
- The group at index
0
corresponds to the overall match. It is always present in every match and never has a name. - The group at index
1
with namefirst
corresponding to the first letter. - The group at index
2
with no name corresponding to the second letter. - The group at index
3
with namelast
corresponding to the fifth and last letter.
Notice that (?:\w)
was not listed above as a capture group despite it
being enclosed in parentheses. That’s because (?:pattern)
is a special
syntax that permits grouping but without capturing. The reason for not
treating it as a capture is that tracking and reporting capture groups
requires additional state that may lead to slower searches. So using as few
capture groups as possible can help performance. (Although the difference
in performance of a couple of capture groups is likely immaterial.)
Values with this type are created by Regex::captures
or
Regex::captures_iter
.
'h
is the lifetime of the haystack that these captures were matched from.
§Example
use regex_lite::Regex;
let re = Regex::new(r"(?<first>\w)(\w)(?:\w)\w(?<last>\w)").unwrap();
let caps = re.captures("toady").unwrap();
assert_eq!("toady", &caps[0]);
assert_eq!("t", &caps["first"]);
assert_eq!("o", &caps[2]);
assert_eq!("y", &caps["last"]);
Implementations§
§impl<'h> Captures<'h>
impl<'h> Captures<'h>
pub fn get(&self, i: usize) -> Option<Match<'h>> ⓘ
pub fn get(&self, i: usize) -> Option<Match<'h>> ⓘ
Returns the Match
associated with the capture group at index i
. If
i
does not correspond to a capture group, or if the capture group did
not participate in the match, then None
is returned.
When i == 0
, this is guaranteed to return a non-None
value.
§Examples
Get the substring that matched with a default of an empty string if the group didn’t participate in the match:
use regex_lite::Regex;
let re = Regex::new(r"[a-z]+(?:([0-9]+)|([A-Z]+))").unwrap();
let caps = re.captures("abc123").unwrap();
let substr1 = caps.get(1).map_or("", |m| m.as_str());
let substr2 = caps.get(2).map_or("", |m| m.as_str());
assert_eq!(substr1, "123");
assert_eq!(substr2, "");
pub fn name(&self, name: &str) -> Option<Match<'h>> ⓘ
pub fn name(&self, name: &str) -> Option<Match<'h>> ⓘ
Returns the Match
associated with the capture group named name
. If
name
isn’t a valid capture group or it refers to a group that didn’t
match, then None
is returned.
Note that unlike caps["name"]
, this returns a Match
whose lifetime
matches the lifetime of the haystack in this Captures
value.
Conversely, the substring returned by caps["name"]
has a lifetime
of the Captures
value, which is likely shorter than the lifetime of
the haystack. In some cases, it may be necessary to use this method to
access the matching substring instead of the caps["name"]
notation.
§Examples
Get the substring that matched with a default of an empty string if the group didn’t participate in the match:
use regex_lite::Regex;
let re = Regex::new(
r"[a-z]+(?:(?<numbers>[0-9]+)|(?<letters>[A-Z]+))",
).unwrap();
let caps = re.captures("abc123").unwrap();
let numbers = caps.name("numbers").map_or("", |m| m.as_str());
let letters = caps.name("letters").map_or("", |m| m.as_str());
assert_eq!(numbers, "123");
assert_eq!(letters, "");
pub fn extract<const N: usize>(&self) -> (&'h str, [&'h str; N]) ⓘ
pub fn extract<const N: usize>(&self) -> (&'h str, [&'h str; N]) ⓘ
This is a convenience routine for extracting the substrings corresponding to matching capture groups.
This returns a tuple where the first element corresponds to the full substring of the haystack that matched the regex. The second element is an array of substrings, with each corresponding to the substring that matched for a particular capture group.
§Panics
This panics if the number of possible matching groups in this
Captures
value is not fixed to N
in all circumstances.
More precisely, this routine only works when N
is equivalent to
Regex::static_captures_len
.
Stated more plainly, if the number of matching capture groups in a regex can vary from match to match, then this function always panics.
For example, (a)(b)|(c)
could produce two matching capture groups
or one matching capture group for any given match. Therefore, one
cannot use extract
with such a pattern.
But a pattern like (a)(b)|(c)(d)
can be used with extract
because
the number of capture groups in every match is always equivalent,
even if the capture indices in each match are not.
§Example
use regex_lite::Regex;
let re = Regex::new(r"([0-9]{4})-([0-9]{2})-([0-9]{2})").unwrap();
let hay = "On 2010-03-14, I became a Tenneessee lamb.";
let Some((full, [year, month, day])) =
re.captures(hay).map(|caps| caps.extract()) else { return };
assert_eq!("2010-03-14", full);
assert_eq!("2010", year);
assert_eq!("03", month);
assert_eq!("14", day);
§Example: iteration
This example shows how to use this method when iterating over all
Captures
matches in a haystack.
use regex_lite::Regex;
let re = Regex::new(r"([0-9]{4})-([0-9]{2})-([0-9]{2})").unwrap();
let hay = "1973-01-05, 1975-08-25 and 1980-10-18";
let mut dates: Vec<(&str, &str, &str)> = vec![];
for (_, [y, m, d]) in re.captures_iter(hay).map(|c| c.extract()) {
dates.push((y, m, d));
}
assert_eq!(dates, vec![
("1973", "01", "05"),
("1975", "08", "25"),
("1980", "10", "18"),
]);
§Example: parsing different formats
This API is particularly useful when you need to extract a particular value that might occur in a different format. Consider, for example, an identifier that might be in double quotes or single quotes:
use regex_lite::Regex;
let re = Regex::new(r#"id:(?:"([^"]+)"|'([^']+)')"#).unwrap();
let hay = r#"The first is id:"foo" and the second is id:'bar'."#;
let mut ids = vec![];
for (_, [id]) in re.captures_iter(hay).map(|c| c.extract()) {
ids.push(id);
}
assert_eq!(ids, vec!["foo", "bar"]);
pub fn expand(&self, replacement: &str, dst: &mut String)
pub fn expand(&self, replacement: &str, dst: &mut String)
Expands all instances of $ref
in replacement
to the corresponding
capture group, and writes them to the dst
buffer given. A ref
can
be a capture group index or a name. If ref
doesn’t refer to a capture
group that participated in the match, then it is replaced with the
empty string.
§Format
The format of the replacement string supports two different kinds of capture references: unbraced and braced.
For the unbraced format, the format supported is $ref
where name
can be any character in the class [0-9A-Za-z_]
. ref
is always
the longest possible parse. So for example, $1a
corresponds to the
capture group named 1a
and not the capture group at index 1
. If
ref
matches ^[0-9]+$
, then it is treated as a capture group index
itself and not a name.
For the braced format, the format supported is ${ref}
where ref
can
be any sequence of bytes except for }
. If no closing brace occurs,
then it is not considered a capture reference. As with the unbraced
format, if ref
matches ^[0-9]+$
, then it is treated as a capture
group index and not a name.
The braced format is useful for exerting precise control over the name
of the capture reference. For example, ${1}a
corresponds to the
capture group reference 1
followed by the letter a
, where as $1a
(as mentioned above) corresponds to the capture group reference 1a
.
The braced format is also useful for expressing capture group names
that use characters not supported by the unbraced format. For example,
${foo[bar].baz}
refers to the capture group named foo[bar].baz
.
If a capture group reference is found and it does not refer to a valid capture group, then it will be replaced with the empty string.
To write a literal $
, use $$
.
§Example
use regex_lite::Regex;
let re = Regex::new(
r"(?<day>[0-9]{2})-(?<month>[0-9]{2})-(?<year>[0-9]{4})",
).unwrap();
let hay = "On 14-03-2010, I became a Tenneessee lamb.";
let caps = re.captures(hay).unwrap();
let mut dst = String::new();
caps.expand("year=$year, month=$month, day=$day", &mut dst);
assert_eq!(dst, "year=2010, month=03, day=14");
pub fn iter<'c>(&'c self) -> SubCaptureMatches<'c, 'h> ⓘ
pub fn iter<'c>(&'c self) -> SubCaptureMatches<'c, 'h> ⓘ
Returns an iterator over all capture groups. This includes both matching and non-matching groups.
The iterator always yields at least one matching group: the first group
(at index 0
) with no name. Subsequent groups are returned in the order
of their opening parenthesis in the regex.
The elements yielded have type Option<Match<'h>>
, where a non-None
value is present if the capture group matches.
§Example
use regex_lite::Regex;
let re = Regex::new(r"(\w)(\d)?(\w)").unwrap();
let caps = re.captures("AZ").unwrap();
let mut it = caps.iter();
assert_eq!(it.next().unwrap().map(|m| m.as_str()), Some("AZ"));
assert_eq!(it.next().unwrap().map(|m| m.as_str()), Some("A"));
assert_eq!(it.next().unwrap().map(|m| m.as_str()), None);
assert_eq!(it.next().unwrap().map(|m| m.as_str()), Some("Z"));
assert_eq!(it.next(), None);
pub fn len(&self) -> usize ⓘ
pub fn len(&self) -> usize ⓘ
Returns the total number of capture groups. This includes both matching and non-matching groups.
The length returned is always equivalent to the number of elements
yielded by Captures::iter
. Consequently, the length is always
greater than zero since every Captures
value always includes the
match for the entire regex.
§Example
use regex_lite::Regex;
let re = Regex::new(r"(\w)(\d)?(\w)").unwrap();
let caps = re.captures("AZ").unwrap();
assert_eq!(caps.len(), 4);
Trait Implementations§
§impl<'h, 'n> Index<&'n str> for Captures<'h>
Get a matching capture group’s haystack substring by name.
impl<'h, 'n> Index<&'n str> for Captures<'h>
Get a matching capture group’s haystack substring by name.
The haystack substring returned can’t outlive the Captures
object if this
method is used, because of how Index
is defined (normally a[i]
is part
of a
and can’t outlive it). To work around this limitation, do that, use
Captures::get
instead.
'h
is the lifetime of the matched haystack, but the lifetime of the
&str
returned by this implementation is the lifetime of the Captures
value itself.
'n
is the lifetime of the group name used to index the Captures
value.
§Panics
If there is no matching group at the given name.
§impl<'h> Index<usize> for Captures<'h>
Get a matching capture group’s haystack substring by index.
impl<'h> Index<usize> for Captures<'h>
Get a matching capture group’s haystack substring by index.
The haystack substring returned can’t outlive the Captures
object if this
method is used, because of how Index
is defined (normally a[i]
is part
of a
and can’t outlive it). To work around this limitation, do that, use
Captures::get
instead.
'h
is the lifetime of the matched haystack, but the lifetime of the
&str
returned by this implementation is the lifetime of the Captures
value itself.
§Panics
If there is no matching group at the given index.
Auto Trait Implementations§
impl<'h> Freeze for Captures<'h>
impl<'h> RefUnwindSafe for Captures<'h>
impl<'h> Send for Captures<'h>
impl<'h> Sync for Captures<'h>
impl<'h> Unpin for Captures<'h>
impl<'h> UnwindSafe for Captures<'h>
Blanket Implementations§
§impl<T> ArchivePointee for T
impl<T> ArchivePointee for T
§type ArchivedMetadata = ()
type ArchivedMetadata = ()
§fn pointer_metadata(
_: &<T as ArchivePointee>::ArchivedMetadata,
) -> <T as Pointee>::Metadata
fn pointer_metadata( _: &<T as ArchivePointee>::ArchivedMetadata, ) -> <T as Pointee>::Metadata
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Source§impl<T> ByteSized for T
impl<T> ByteSized for T
Source§const BYTE_ALIGN: usize = _
const BYTE_ALIGN: usize = _
Source§fn byte_align(&self) -> usize ⓘ
fn byte_align(&self) -> usize ⓘ
Source§fn ptr_size_ratio(&self) -> [usize; 2]
fn ptr_size_ratio(&self) -> [usize; 2]
Source§impl<T, R> Chain<R> for Twhere
T: ?Sized,
impl<T, R> Chain<R> for Twhere
T: ?Sized,
Source§impl<T> ExtAny for T
impl<T> ExtAny for T
Source§fn as_any_mut(&mut self) -> &mut dyn Anywhere
Self: Sized,
fn as_any_mut(&mut self) -> &mut dyn Anywhere
Self: Sized,
Source§impl<T> ExtMem for Twhere
T: ?Sized,
impl<T> ExtMem for Twhere
T: ?Sized,
Source§const NEEDS_DROP: bool = _
const NEEDS_DROP: bool = _
Source§fn mem_align_of_val(&self) -> usize ⓘ
fn mem_align_of_val(&self) -> usize ⓘ
Source§fn mem_size_of_val(&self) -> usize ⓘ
fn mem_size_of_val(&self) -> usize ⓘ
Source§fn mem_needs_drop(&self) -> bool
fn mem_needs_drop(&self) -> bool
true
if dropping values of this type matters. Read moreSource§fn mem_forget(self)where
Self: Sized,
fn mem_forget(self)where
Self: Sized,
self
without running its destructor. Read moreSource§fn mem_replace(&mut self, other: Self) -> Selfwhere
Self: Sized,
fn mem_replace(&mut self, other: Self) -> Selfwhere
Self: Sized,
Source§unsafe fn mem_zeroed<T>() -> T
unsafe fn mem_zeroed<T>() -> T
unsafe_layout
only.T
represented by the all-zero byte-pattern. Read moreSource§unsafe fn mem_transmute_copy<Src, Dst>(src: &Src) -> Dst
unsafe fn mem_transmute_copy<Src, Dst>(src: &Src) -> Dst
unsafe_layout
only.T
represented by the all-zero byte-pattern. Read moreSource§fn mem_as_bytes(&self) -> &[u8] ⓘ
fn mem_as_bytes(&self) -> &[u8] ⓘ
unsafe_slice
only.§impl<S> FromSample<S> for S
impl<S> FromSample<S> for S
fn from_sample_(s: S) -> S
Source§impl<T> Hook for T
impl<T> Hook for T
§impl<T> Instrument for T
impl<T> Instrument for T
§fn instrument(self, span: Span) -> Instrumented<Self> ⓘ
fn instrument(self, span: Span) -> Instrumented<Self> ⓘ
§fn in_current_span(self) -> Instrumented<Self> ⓘ
fn in_current_span(self) -> Instrumented<Self> ⓘ
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self> ⓘ
fn into_either(self, into_left: bool) -> Either<Self, Self> ⓘ
self
into a Left
variant of Either<Self, Self>
if into_left
is true
.
Converts self
into a Right
variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self> ⓘ
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self> ⓘ
self
into a Left
variant of Either<Self, Self>
if into_left(&self)
returns true
.
Converts self
into a Right
variant of Either<Self, Self>
otherwise. Read more§impl<F, T> IntoSample<T> for Fwhere
T: FromSample<F>,
impl<F, T> IntoSample<T> for Fwhere
T: FromSample<F>,
fn into_sample(self) -> T
§impl<T> LayoutRaw for T
impl<T> LayoutRaw for T
§fn layout_raw(_: <T as Pointee>::Metadata) -> Result<Layout, LayoutError> ⓘ
fn layout_raw(_: <T as Pointee>::Metadata) -> Result<Layout, LayoutError> ⓘ
§impl<T, N1, N2> Niching<NichedOption<T, N1>> for N2
impl<T, N1, N2> Niching<NichedOption<T, N1>> for N2
§unsafe fn is_niched(niched: *const NichedOption<T, N1>) -> bool
unsafe fn is_niched(niched: *const NichedOption<T, N1>) -> bool
§fn resolve_niched(out: Place<NichedOption<T, N1>>)
fn resolve_niched(out: Place<NichedOption<T, N1>>)
out
indicating that a T
is niched.