python - 正則如何提取標題
問題描述
通過以下代碼提取標題:Delivery from £3.99 at Yours Clothing請問正則怎么書寫呀?跪求高手指點,謝謝!
<article data-offer-type='deal' class='offer-module js-offer-module list-module deal
merch-offer ' data-merchant='Yours Clothing' data-revision='2' data-variant='0' data-tab-group='online'><p class='offer-border'><a href='http://www.cgvv.com.cn/wenda/10332.html#' data-offerid='[4611854]'><i src-src='https://static-cdn.voucherco.co.uk/v10/images/_generated-sprites/offer-module-sprite@1x-cb-f3f7588d80d53be535315092f1d3d9ad.png' src-retina-src='https://static-cdn.vouchercodes.co.uk/v10/images/_generated-sprites/offer-module-sprite@2x-cb-659802c69b2fbdde72289424326e4eb4.png'></i> </a><p class='left-col'> <a href='http://www.cgvv.com.cn/out/offer/4611854/e7132f242406a3ef32a2b03703e9796951dff0cd/?ps=9&pageViewID=14903446907945303647157958d4daf24d1f5971796&wotst=ve0317_nove&mi=yoursclothing.co.uk&ppc=&tl=deal-offerimg&opi=mpx&inv=online&scc=0&sss=merchant&spn=%2Fyoursclothing.co.uk&spl=desktop&spv=14903439557945303648611658d4d813a3574281781&stv=ve0317_nove&sui=null&sli=0'><imgsrc='https://static-cdn.voucherco.co.uk/v10/images/merchant/logo/128px/1142_140311175132.png' alt='Yours Clothing'/><strong class='offer-type label-deal'>deal</strong> </a></p><p class='offer-details'> <p class='header-wrapper'><h3 class='tp-offertitle js-offer-title'><a href='http://www.cgvv.com.cn/out/offer/4611854/e7132f242406a3ef32a2b03703e9796951dff0cd/?ps=9&pageViewID=14903446907945303647157958d4daf24d1f5971796&wotst=ve0317_nove&mi=yoursclothing.co.uk&ppc=&tl=deal-title&opi=mpx&inv=online&scc=0&sss=merchant&spn=%2Fyoursclothing.co.uk&spl=desktop&spv=14903439557945303648611658d4d813a3574281781&stv=ve0317_nove&sui=null&sli=0' class='js-click-reveal'> Delivery <strong>from £3.99</strong> at Yours Clothing</a> </h3>## 標題文字 ##
問題解答
回答1:>>> str_split = re.findall(r’js-click-reveal'>n([sS]*?)<strong>([sS]*?)</strong>([sS]*?)n’, html)[0]>>> print str_split[0].lstrip() + str_split[1] + str_split[2]Delivery from £3.99 at Yours Clothing>>>
相關文章:
1. 關docker hub上有些鏡像的tag被標記““This image has vulnerabilities””2. javascript - iframe 為什么加載網頁的時候滾動條這樣顯示?3. docker容器呢SSH為什么連不通呢?4. docker網絡端口映射,沒有方便點的操作方法么?5. docker api 開發的端口怎么獲取?6. debian - docker依賴的aufs-tools源碼哪里可以找到啊?7. nignx - docker內nginx 80端口被占用8. javascript - 移動端,當出現遮罩層的時候,遮罩層里有div是超出高度scroll的,怎么避免滑動div的時候,body跟隨滑動?9. 新手 - Python 爬蟲 問題 求助10. ddos - apache日志很多其它網址,什么情況?
