Welcome to the Treehouse Community

Want to collaborate on code errors? Have bugs you need feedback on? Looking for an extra set of eyes on your latest project? Get support with fellow developers, designers, and programmers of all backgrounds and skill levels here with the Treehouse Community! While you're at it, check out some resources Treehouse students have shared here.

Looking to learn something new?

Treehouse offers a seven day free trial for new students. Get access to thousands of hours of content and join thousands of Treehouse students and alumni in the community today.

Start your free trial

Posted September 18, 2018 10:08pm by

'NoneType' object is not subscriptable

'NoneType' object is not subscriptable I am not sure why this is throwing an error in Python web scraping

from urllib.request import urlopen from bs4 import BeautifulSoup

import re

def internal_links(linkURL): html = urlopen('https://treehouse-projects.github.io/horse-land/{}' .format(linkURL)) soup = BeautifulSoup(html, 'html.parser')

return soup.find('a', href=re.compile('(.html)$'))

if name == 'main': urls =internal_links('index.html') while len(urls) > 0: page = urls.attr['href']

    print(page)
    print('\n===============\n')
    urls = internal_links(page)

February 18, 2019 7:03am

I have an additional question: so is href below just a kwarg made up in that line? Since href isn't normally a predefined parameter to the BeautifulSoup.find() method, I wonder if I understand that right, do I?

return soup.find('a', href=re.compile('(.html)$'))

1 Answer

September 18, 2018 10:27pm

I am sorry I just figured it out. I put urls.attr instead of urls.attrs.

Posting to the forum is only allowed for members with active accounts.
Please sign in or sign up to post.

Welcome to the Treehouse Community

Looking to learn something new?

Michael Poehner

Michael Poehner

'NoneType' object is not subscriptable

Mark Chesney

Mark Chesney

1 Answer

Michael Poehner

Michael Poehner