104 lines
7.0 KiB
Plaintext
104 lines
7.0 KiB
Plaintext
{
|
||
"cells": [
|
||
{
|
||
"cell_type": "code",
|
||
"execution_count": 6,
|
||
"metadata": {},
|
||
"outputs": [
|
||
{
|
||
"name": "stdout",
|
||
"output_type": "stream",
|
||
"text": [
|
||
"title= 青岛:迎春花灯美轮美奂 市民观赏其乐陶陶\n",
|
||
"author= []\n",
|
||
"publish_date= None\n",
|
||
"summary= \n",
|
||
"text= 2021年2月15日晚,山东青岛西海岸新区金沙滩啤酒城花灯璀璨,赏心悦目。当日,各式花灯流光溢彩,市民和游客被美轮美奂的多种牛的造型花灯和十二生肖花灯吸引,其乐陶陶,共享新春灯会带来的欢乐和艺术盛宴。\n",
|
||
"\n",
|
||
"2021年2月15日晚,山东青岛西海岸新区金沙滩啤酒城花灯璀璨,赏心悦目。当日,各式花灯流光溢彩,市民和游客被美轮美奂的多种牛的造型花灯和十二生肖花灯吸引,其乐陶陶,共享新春灯会带来的欢乐和艺术盛宴。\n",
|
||
"\n",
|
||
"2021年2月15日晚,山东青岛西海岸新区金沙滩啤酒城花灯璀璨,赏心悦目。当日,各式花灯流光溢彩,市民和游客被美轮美奂的多种牛的造型花灯和十二生肖花灯吸引,其乐陶陶,共享新春灯会带来的欢乐和艺术盛宴。\n",
|
||
"\n",
|
||
"2021年2月15日晚,山东青岛西海岸新区金沙滩啤酒城花灯璀璨,赏心悦目。当日,各式花灯流光溢彩,市民和游客被美轮美奂的多种牛的造型花灯和十二生肖花灯吸引,其乐陶陶,共享新春灯会带来的欢乐和艺术盛宴。\n",
|
||
"\n",
|
||
"2021年2月15日晚,山东青岛西海岸新区金沙滩啤酒城花灯璀璨,赏心悦目。当日,各式花灯流光溢彩,市民和游客被美轮美奂的多种牛的造型花灯和十二生肖花灯吸引,其乐陶陶,共享新春灯会带来的欢乐和艺术盛宴。\n",
|
||
"\n",
|
||
"2021年2月15日晚,山东青岛西海岸新区金沙滩啤酒城花灯璀璨,赏心悦目。当日,各式花灯流光溢彩,市民和游客被美轮美奂的多种牛的造型花灯和十二生肖花灯吸引,其乐陶陶,共享新春灯会带来的欢乐和艺术盛宴。\n",
|
||
"\n",
|
||
"2021年2月15日晚,山东青岛西海岸新区金沙滩啤酒城花灯璀璨,赏心悦目。当日,各式花灯流光溢彩,市民和游客被美轮美奂的多种牛的造型花灯和十二生肖花灯吸引,其乐陶陶,共享新春灯会带来的欢乐和艺术盛宴。\n",
|
||
"\n",
|
||
"2021年2月15日晚,山东青岛西海岸新区金沙滩啤酒城花灯璀璨,赏心悦目。当日,各式花灯流光溢彩,市民和游客被美轮美奂的多种牛的造型花灯和十二生肖花灯吸引,其乐陶陶,共享新春灯会带来的欢乐和艺术盛宴。\n",
|
||
"\n",
|
||
"2021年2月15日晚,山东青岛西海岸新区金沙滩啤酒城花灯璀璨,赏心悦目。当日,各式花灯流光溢彩,市民和游客被美轮美奂的多种牛的造型花灯和十二生肖花灯吸引,其乐陶陶,共享新春灯会带来的欢乐和艺术盛宴。\n",
|
||
"\n",
|
||
"2021年2月15日晚,山东青岛西海岸新区金沙滩啤酒城花灯璀璨,赏心悦目。当日,各式花灯流光溢彩,市民和游客被美轮美奂的多种牛的造型花灯和十二生肖花灯吸引,其乐陶陶,共享新春灯会带来的欢乐和艺术盛宴。\n",
|
||
"\n",
|
||
"2021年2月15日晚,山东青岛西海岸新区金沙滩啤酒城花灯璀璨,赏心悦目。当日,各式花灯流光溢彩,市民和游客被美轮美奂的多种牛的造型花灯和十二生肖花灯吸引,其乐陶陶,共享新春灯会带来的欢乐和艺术盛宴。\n",
|
||
"\n",
|
||
"2021年2月15日晚,山东青岛西海岸新区金沙滩啤酒城花灯璀璨,赏心悦目。当日,各式花灯流光溢彩,市民和游客被美轮美奂的多种牛的造型花灯和十二生肖花灯吸引,其乐陶陶,共享新春灯会带来的欢乐和艺术盛宴。\n",
|
||
"\n",
|
||
"责编:吕原 \n",
|
||
"\n",
|
||
"images= {'https://appimg.dzwww.com/2021/0216/602af8081c74c.jpg', 'https://appimg.dzwww.com/2021/0216/602af808209c1.jpg', 'http://hb.dzwww.com/images/hb.ico', 'http://hb.dzwww.com/images/pc/top-logo-hb.png?v=1', 'https://appapi.dzwww.com/images/icon/cities/.png', 'https://appimg.dzwww.com/2021/0216/602af80861c94.jpg', 'https://appimg.dzwww.com/2021/0216/602af8086df8a.jpg', 'https://appimg.dzwww.com/2021/0216/602af8082499d.jpg', 'http://hb.dzwww.com/images/pc/dyh.jpg?v=1', 'https://appimg.dzwww.com/share/2021/02/16/a7c665f6c127cdb.jpg', 'https://appimg.dzwww.com/2021/0216/602af80865b93.jpg', 'http://hb.dzwww.com/images/pc/code-btn2.gif', 'https://appimg.dzwww.com/2021/0216/602af8085dbd4.jpg', 'http://hb.dzwww.com/images/pc/code-btn1.gif', 'https://appimg.dzwww.com/2021/0216/602af8085ac7d.jpg', 'https://appimg.dzwww.com/2021/0216/602af8081e9a8.jpg', 'http://appimg.dzwww.com/dzcloud/20200519/6d4cb0f483c28be39a7acc4db0e5c209.jpg', 'https://appimg.dzwww.com/2021/0216/602af80828c60.jpg', 'https://uct01.dzwww.com/images/noavatar_middle.gif', 'https://appimg.dzwww.com/2021/0216/602af80822e80.jpg', 'http://hb.dzwww.com/images/pc/code.gif?v=1', 'https://appimg.dzwww.com/2021/0216/602af80872e82.jpg', 'http://hb.dzwww.com/images/pc/kfzsq.gif?v=1', 'https://appimg.dzwww.com/share/2021/02/17/cdfe65611cdecfc.jpg', 'http://hb.dzwww.com/images/pc/noavatar_middle.jpg'}\n"
|
||
]
|
||
}
|
||
],
|
||
"source": [
|
||
"import time\n",
|
||
"from newspaper import Article\n",
|
||
"\n",
|
||
"url = 'http://hb.dzwww.com/p/7882663.html'\n",
|
||
"# 创建文章对象\n",
|
||
"news = Article(url, language='zh')\n",
|
||
"# 下载网页\n",
|
||
"news.download()\n",
|
||
"## 网页解析\n",
|
||
"news.parse()\n",
|
||
"print(\"title=\",news.title)# 获取文章标题\n",
|
||
"print(\"author=\", news.authors) # 获取文章作者\n",
|
||
"print(\"publish_date=\", news.publish_date) # 获取文章日期\n",
|
||
"# 自然语言处理\n",
|
||
"#news.nlp()\n",
|
||
"#print('keywords=',news.keywords)#从文本中提取关键字\n",
|
||
"print(\"summary=\",news.summary)# 获取文章摘要\n",
|
||
"# time.sleep(30)\n",
|
||
"print(\"text=\",news.text,\"\\n\")# 获取文章正文\n",
|
||
"\n",
|
||
"#print(\"movies=\",news.movies) # 获取文章视频链接\n",
|
||
"#print(\"top_iamge=\",news.top_image) # 获取文章顶部图片地址\n",
|
||
"print(\"images=\",news.images)#从html中提取所有图像\n",
|
||
"#print(\"imgs=\",news.imgs)\n",
|
||
"#print(\"html=\",news.html)#获取html"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "code",
|
||
"execution_count": null,
|
||
"metadata": {},
|
||
"outputs": [],
|
||
"source": []
|
||
}
|
||
],
|
||
"metadata": {
|
||
"kernelspec": {
|
||
"display_name": "Python [conda env:root] *",
|
||
"language": "python",
|
||
"name": "conda-root-py"
|
||
},
|
||
"language_info": {
|
||
"codemirror_mode": {
|
||
"name": "ipython",
|
||
"version": 3
|
||
},
|
||
"file_extension": ".py",
|
||
"mimetype": "text/x-python",
|
||
"name": "python",
|
||
"nbconvert_exporter": "python",
|
||
"pygments_lexer": "ipython3",
|
||
"version": "3.7.4"
|
||
}
|
||
},
|
||
"nbformat": 4,
|
||
"nbformat_minor": 2
|
||
}
|