Negation Regex I need help with \b

Question

Here is a reminder of Kenneth's code:
```python
import re
namesfile = open('names.txt', encoding = 'utf-8')
data = namesfile.read()
names_file.close()
print(re.findall(r'''
    \b@[-\w\d.]*
    [^gov	]+
    \b
''', data, re.X|re.I))
```
This gives a result of the @ part of email addresses, excluding "gov", a few examples here:
['@teamtreehouse.com', '@camelot.co.uk', '@spain.']
I have tried taking out the two '\b' and it gives the result:
['@kennethlove
McFarland, ', '@potus44
Chalkey, Andrew']
I don't understand why taking out the \b will mean that I find combinations including '
', ',' and ' ' - none of those are contained in my @[-\w.+]*, so why would those appear?

Saikat Chowdhury · Accepted Answer

Hi Flore,
 /b is looking for word boundary or edges of the word define by white spaces. Here It will check start and end of the string.
my thinking is ,  when you remove two \b from your regex script then it started searching twitter records. And Twitter record has @ . Because of + sign it goes to next line . And you are getting below result 
eg. ['@kennethlove
McFarland, ', '@potus44
Chalkey, Andrew']  
if you remove + sign from  [^gov	]\b  then  it will stay on same line, even you remove \b from your regex script.
Regards,
Saikat

Welcome to the Treehouse Community

Looking to learn something new?

Flore W

Flore W

Negation Regex I need help with \b

1 Answer

Saikat Chowdhury

Saikat Chowdhury