Outputting <title></title> content from HTML

Question

Original Question: "Writing a program that reads from a public web page and looks for the title of the page. The program should then output the contents of the page title to STDOUT using an ERB template."

My problem: I've been able to retrieve the HTML of a webpage and save it to a txt file; however, i'm a bit confused as to how I can get the program to output the content of the <title> element. I know that I can find the <title></title> tags using .include?(), /title/, etc., but when it comes to actually outputting the content, i'm a bit lost.

Without telling me the answer, would someone mind maybe pointing me in the right direction or giving me a hint?

In other words, if the title is

            <title>My Name</title>

How can I get ruby to search the .txt file and spit out "My Name"?

Thank you!

Answer 1 · 2013-12-23T20:24:36Z

December 23, 2013 8:24pm

I don't know Ruby myself, but my best guess would be to use regular expressions. Something like <[^>]+> allows you to find opening and closing tags which you can then exclude.

[edit]

This is probably better for your purpose:

(?<=(<title>))[^<]+

That should match anything in between two title tags.

[edit 2] This also works in case there are different tags inside the tag you're searching:

(?<=(<title>)).*(?=</title>)

Answer 2 · 2013-12-24T01:02:24Z

December 24, 2013 1:02am

Thank you, Niklas. I guess I just didn't know how to write the right regexp.

Welcome to the Treehouse Community

Looking to learn something new?

max g

max g

Outputting <title></title> content from HTML

2 Answers

sdfasdfagasdgasdg

sdfasdfagasdgasdg

max g

max g