Regex - find word in string

Question

I have a string: prawy p pęknięty p zderzak pęknięcie (it's Polish language)

I want to select all p (except "p" in words "pęknięty" and "peknięcie")

I've tried to do something like that: \b(s*ps*)\b, but it doesn't work properly. Any ideas?

So, all `p` letters at the beginning of words but those two specific words? Try `\bp(?!ęknię(?:ty|cie)\b)` — Wiktor Stribiżew, Oct 10 '19 at 14:26

score 0 · Answer 1 · edited Jun 20 '20 at 09:12

0

Maybe,

\bp(?=[a-z]+|\s|$)

or

(?!pęknięcie|pęknięty)\bp

might simply work fine.

Demo 1

Demo 2

If you wish to simplify/modify/explore the expression, it's been explained on the top right panel of regex101.com. If you'd like, you can also watch in this link, how it would match against some sample inputs.

RegEx Circuit

jex.im visualizes regular expressions:

edited Jun 20 '20 at 09:12

Community

1
1

answered Oct 10 '19 at 14:13

Emma

27,428
11
44
69

The fourth bird · Answer 2 · 2019-10-10T17:23:33.043

You might use a negative lookahead and a character class:

\bp(?!([eę]knię(?:cie|ty)\b)

In parts

\bp preceded by a word boundary
(?! If what is directly on the right is not
- [eę]knię Match e or ę followed by knię
- (?:cie|ty)\b Match cie or ty and a word boundary
) Close negative lookahead

Regex demo

Using a character class might match an invalid variation of e or ę in the words.

To match the words exactly you could match them between word boundaries

\bp(?!ęknięty\b|ęknięcie\b)

Regex demo

Regex - find word in string

2 Answers2

Demo 1

Demo 2

RegEx Circuit