Is it possible to search by regexp with Symfony Dom crawler?

Question

The Dom Crawler Component is powerfull to parse html content, in its documentation describes basics selections (like filter('body > p')) or more complex xpath like //span[contains(@id, "article-")]

Is it possible to fetch elements by regular expression? Maybe something like that is available: filter('body')->filter('div.*-timeLabel-*') ?

score 1 · Answer 1 · answered Feb 25 '19 at 13:31

1

Something like this? Modified one of the examples from the docs applying a anonymous function.

$nodeValues = $crawler->filter('body')->each(function (Crawler $node, $i) {
    // regex and return $node->attr('class')
});

answered Feb 25 '19 at 13:31

Sondre Edvardsen

182
6

score 0 · Answer 2 · answered Feb 25 '19 at 13:32

i'm not sure but i think the answer is yes cuz the filter method of the crawler calls this method of the CssSelectorConverter and according to the documentation you can pass an expression as a parameter

    /**
     * Translates a CSS expression to its XPath equivalent.
     *
     * Optionally, a prefix can be added to the resulting XPath
     * expression with the $prefix parameter.
     *
     * @param string $cssExpr The CSS expression
     * @param string $prefix  An optional prefix for the XPath expression
     *
     * @return string
     */
    public function toXPath($cssExpr, $prefix = 'descendant-or-self::')
    {
        return $this->translator->cssToXPath($cssExpr, $prefix);
    }

score 0 · Accepted Answer · answered Dec 14 '19 at 14:09

in XPath 2.0, you can use matches:

$crawler->filterXPath("//div[matches(@id, '*-timeLabel-*')]");

but if you don't have that available, your best bet is to try and combine some of the other XPath methods, for example this should do the trick for your case:

$crawler->filterXPath("//div[contains(@id, '*-timeLabel-*')]");

Is it possible to search by regexp with Symfony Dom crawler?

3 Answers3