How do you include negations in grouping? I'm pretty sure that's the reason my code hasn't been accepted.

Question

I'm being advised I've not got the right regex, but the request was 2 create 2 groups entitled email and phone using search, that omits the comma and space. I assume I should be able to use a negate here too. I've tried putting this in a verbose format but seem to have issues with my formatting. Do you have to tab a specific distance across from the edge in order for this to work?

emails.py

import re

string = '''Love, Kenneth, kenneth+challenge@teamtreehouse.com, 555-555-5555, @kennethlove
Chalkley, Andrew, andrew@teamtreehouse.co.uk, 555-555-5556, @chalkers
McFarland, Dave, dave.mcfarland@teamtreehouse.com, 555-555-5557, @davemcfarland
Kesten, Joy, joy@teamtreehouse.com, 555-555-5558, @joykesten'''

contacts = re.search(r'(?P<email>\b[-\d\w.+]+@[-\d\w.]+)([,]\s)(?P<phone>\d{3}-\d{3}-\d{4})',string,re.I)

Answer 1 · 2019-01-15T01:58:06Z

January 15, 2019 1:58am

To answer your first question, you include negations as normal in grouping ([^characters go here]). Perhaps the reason that it did not work for you is that you were including .+ after the negated set. By default, the negated set gets all characters not in the set, so the .+ would not be necessary. The (unnamed) group for the email part would look like this:

([^ ,]+@.+\.\w+)

Answer 2 · 2019-01-17T01:02:44Z

January 17, 2019 1:02am

contacts = re.search('(?P<email>[^ ,]+@.+\.\w+).+(?P<phone>(\d{3}-){2}\d{4})', string)

What I meant was the pattern I gave you was that pattern you needed, except I did not add a name using (?P<email). The group that you made for the phone number appears correct; it was only the email one that needed fixing.

To negate the command and space, you should add [^ ,] (there is a space before comma). Remember that this will get any one character that is not a space or a comma, so you have to add the + after it to get the part of the email that is before the @ symbol. Also, instead of specifying the characters that you needed after the @ symbol manually and surrounding them with brackets, I just used .+ to get everything before the last period.

Answer 3 · 2019-03-22T22:30:58Z

March 22, 2019 10:30pm

I was confused by this as well. I used the comma and space to separate the two groups and it worked for me:

contacts = re.search(r'(?P<email>[-\w\d+.]+@[-\w\d+.]+), (?P<phone>\d{3}-\d{3}-\d{4})', string)

Welcome to the Treehouse Community

Looking to learn something new?

David Gahagan

David Gahagan

How do you include negations in grouping? I'm pretty sure that's the reason my code hasn't been accepted.

3 Answers

Eduardo Valencia

Eduardo Valencia

David Gahagan

David Gahagan

Eduardo Valencia

Eduardo Valencia

William Ennals

William Ennals