w3resource

Python BeautifulSoup: Print the names of all HTML tags of a given web page going through the document tree

BeautifulSoup: Exercise-13 with Solution

Write a Python program to print the names of all HTML tags of a given web page going through the document tree.

Sample Solution:

Python Code:

import requests
from bs4 import BeautifulSoup
url = 'https://www.python.org/'
reqs = requests.get(url)
soup = BeautifulSoup(reqs.text, 'lxml')
print("\nNames of all HTML tags (https://www.python.org):\n")
for child in soup.recursiveChildGenerator():
    if child.name:
        print(child.name)

Sample Output:

Names of all HTML tags (https://www.python.org):

html
head
meta
meta
link
meta
meta
meta
meta
meta
meta
meta
meta
meta
meta
script
link
link
link
link
link
link
link
link
meta
meta
meta
title
..........

script
script
script
script
script

Python Code Editor:

Have another way to solve this solution? Contribute your code (and comments) through Disqus.

Previous: Write a Python program to extract all the text from a given web page.
Next: Write a Python program to retrieve children of the html tag from a given web page.

What is the difficulty level of this exercise?

Test your Python skills with w3resource's quiz



Python: Tips of the Day

Getting the last element of a list:

some_list[-1] is the shortest and most Pythonic.

In fact, you can do much more with this syntax. The some_list[-n] syntax gets the nth-to-last element. So some_list[-1] gets the last element, some_list[-2] gets the second to last, etc, all the way down to some_list[-len(some_list)], which gives you the first element.

You can also set list elements in this way. For instance:

>>> some_list = [1, 2, 3]
>>> some_list[-1] = 5 # Set the last element
>>> some_list[-2] = 3 # Set the second to last element
>>> some_list
[1, 3, 5]

Note that getting a list item by index will raise an IndexError if the expected item doesn't exist. This means that some_list[-1] will raise an exception if some_list is empty, because an empty list can't have a last element.

Ref: https://bit.ly/3d8TfFP