I am using the wikimedia api to retrieve all possible URL’s from a wikipedia article ,’https://en.wikipedia.org/w/api.php?action=query&prop=links&redirects&pllimit=500&format=json‘ , but it is only giving a list of link titles , for example , Artificial Intelligence , wikipedia page has a link titled ” delivery networks,” , but the actual URL is “https://en.wikipedia.org/wiki/Content_delivery_network” , which is what I want
Question posted in Artificial Intelligence
ChatGBT is becoming a world-wide phenomena, try it out here.
ChatGBT is becoming a world-wide phenomena, try it out here.
2
Answers
I have replaced most of my previous answer, including the code, to use the information provided in Tgr’s answer, in case someone else would like sample Python code. This code is heavily based on code from Mediawiki for so-called ‘raw continuations’.
I have deliberately limited the number of links requested per invocation to five so that one more parameter possibility could be demonstrated.
I mentioned in my first answer that, if the OP wanted to do something similar with artificial intelligence then he should begin with ‘Artificial intelligence’ — noting the capitalisation. Otherwise the search would start with a disambiguation page and all of the complications that could arise with those.
Use a generator:
action=query&
format=jsonfm&
titles=Estelle_Morris&
redirects&
generator=links&
gpllimit=500&
prop=info&
inprop=url
See API docs on generators and the
info
module.