How can I get image title using Wikipedia API

Question

Suppose, that I got URLs of images on the page,

 for i in wiki.images:
    print (i)

Is there easy way to get images titles?

what exactly are you getting in i can you give us a hint plz — Chetan Vashisth, May 21 '19 at 04:17

score 3 · Accepted Answer · answered May 21 '19 at 04:22

3

try:

If you are looping through all the urls of the images then you can try

for i in wiki.images:
    i.split('/')[-1]  # -1 because the name is at the last part of the url

So the above code will give you the image name.

Hope this helps...

answered May 21 '19 at 04:22

Chetan Vashisth

438
4
14

score 2 · Answer 2 · answered May 21 '19 at 04:32

If what you are trying to get are the image tag's title attribute (i.e., from the HTML), you could do something akin to:

import wikipedia
from html.parser import HTMLParser

class WikipediaImageParser(HTMLParser):
    def handle_starttag(self, tag, attrs):
        if tag == 'img':
            try:
                print(dict(attrs)['title'])
            except KeyError as e:
                return # do nothing

page = wikipedia.page("History_of_Japan")
parser = WikipediaImageParser()
parser.feed(page.html())

You can parse the HTML to get a dict of the attributes for each image and then just check if there is a title attribute.

How can I get image title using Wikipedia API

2 Answers2