Python BeautifulSoup: Extract a tag or string from a given tree of html document

Last update on December 21 2024 07:24:36 (UTC/GMT +8 hours)

Write a Python program to extract a tag or string from a given tree of html document.

Sample Solution:

Python Code:

from bs4 import BeautifulSoup
html_content = '<a href="https://w3resource.com/">Python exercises<i>w3resource</i></a>'
soup = BeautifulSoup(html_content, "lxml")
print("Original Markup:")
print(soup.a)
i_tag = soup.i.extract()
print("\nExtract i tag from said html Markup:")
print(i_tag)

Sample Output:

Original Markup:
<a href="https://w3resource.com/">Python exercises<i>w3resource</i></a>

Extract i tag from said html Markup:
<i>w3resource</i>

Python Code Editor:

Have another way to solve this solution? Contribute your code (and comments) through Disqus.

Previous: Write a Python program to remove the contents of a tag in a given html document.
Next: Write a Python program to remove a tag from a given tree of html document and destroy it and its contents..