关于python：如何从＆lt; a＆gt;获取网址和标题

How to get the url and the title from the <a> tags with beautifulSoup

我正在编写一个脚本，用class="pntc txt"从div获取所有链接，在我想从获取链接后，标记href属性和Something之间的文本。for after获取该URL和文本并将它们插入数据库中。我将发布到目前为止所做的代码：

1
2
3
4
5
6
7
8
9
10
11
12
13

import urllib.request
from bs4 import *

sock = urllib.request.urlopen("http://as.com/tag/moto_gp/a/")
htmlSource = sock.read()
sock.close()

soup = BeautifulSoup(htmlSource)

for div in soup.findAll('div', {'class': 'pntc-txt'}):
a = div.findAll('a')
print (a)