Advertisement
furas

Python - Pill bug

Dec 11th, 2016
208
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
Python 4.55 KB | None | 0 0
  1.  
  2. # --- SCRIPT ---
  3.  
  4. html = '''<table id="Infobox-bug" align="right" style="background: #adff2f; margin-left: 10px; margin-bottom: 10px; border-radius: 10px; -moz-border-radius: 10px; -webkit-border-radius: 10px; -khtml-border-radius: 10px; -icab-border-radius: 10px; -o-border-radius: 10px; border: 3px solid #9acd32; width: 25%">
  5.  
  6. <tr align="center">
  7. <td colspan="2"> <big><big><b>Pill Bug</b></big></big>
  8. </td></tr>
  9. <tr align="center">
  10. <td style="background: #caecc9; border-radius: 10px; -moz-border-radius: 10px; -webkit-border-radius: 10px; -khtml-border-radius: 10px; -icab-border-radius: 10px; -o-border-radius: 10px;" colspan="2"> <a href="/wiki/File:Pill_Bug_Picture.jpg" class="image"><img alt="Pill Bug Picture.jpg" src="/w/images/b/bb/Pill_Bug_Picture.jpg" width="199" height="186" /></a>
  11. </td></tr>
  12. <tr>
  13. <th style="background: #86df2d; border-top-left-radius: 10px; -moz-border-radius-topleft: 10px; -webkit-border-top-left-radius: 10px; -khtml-border-top-left-radius: 10px; -icab-border-top-left-radius: 10px; -o-border-top-left-radius: 10px;" align="right"> Scientific name
  14. </th>
  15. <td style="background:#ffffff; border-top-right-radius: 10px; -moz-border-radius-topright: 10px; -webkit-border-top-right-radius: 10px; -khtml-border-top-right-radius: 10px; -icab-border-top-right-radius: 10px; -o-border-top-right-radius: 10px;" align="left"> <i>Armadillidium vulgare</i>
  16. </td></tr>
  17. <tr>
  18. <th style="background: #86df2d" align="right"> Family
  19. </th>
  20. <td style="background:#ffffff" align="left"> <i>Armadillidiidae - Terrestrial Custaceans</i>
  21. </td></tr>
  22. <tr>
  23. <th style="background: #86df2d" align="right"> Time of year
  24. </th>
  25. <td style="background:#ffffff" align="left"> All year
  26. </td></tr>
  27. <tr>
  28. <th style="background: #86df2d" align="right"> Time of day
  29. </th>
  30. <td style="background:#ffffff" align="left"> All day
  31. </td></tr>
  32. <tr>
  33. <th style="background: #86df2d" align="right"> Location
  34. </th>
  35. <td style="background:#ffffff" align="left"> Under rocks
  36. </td></tr>
  37. <tr>
  38. <th style="background: #86df2d" align="right"> Size
  39. </th>
  40. <td style="background:#ffffff" align="left"> 2 mm
  41. </td></tr>
  42. <tr>
  43. <th style="background: #86df2d" align="right"> Rarity
  44. </th>
  45. <td style="background:#ffffff" align="left"> Common
  46. </td></tr>
  47. <tr>
  48. <th style="background: #86df2d" align="right"> Selling price
  49. </th>
  50. <td style="background:#ffffff" align="left"> 250 <a href="/wiki/Bells" title="Bells">Bells</a>
  51. </td></tr>
  52. <tr>
  53. <th style="background: #86df2d; border-bottom-left-radius: 10px; -moz-border-radius-bottomleft: 10px; -webkit-border-bottom-left-radius: 10px; -khtml-border-bottom-left-radius: 10px; -icab-border-bottom-left-radius: 10px; -o-border-bottom-left-radius: 10px;" align="right"> Appearances
  54. </th>
  55. <td style="background:#ffffff; border-bottom-right-radius: 10px; -moz-border-radius-bottomright: 10px; -webkit-border-bottom-right-radius: 10px; -khtml-border-bottom-right-radius: 10px; -icab-border-bottom-right-radius: 10px; -o-border-bottom-right-radius: 10px;" align="left"> <i><a href="/wiki/Doubutsu_no_Mori_(game)" title="Doubutsu no Mori (game)">Doubutsu no Mori</a></i>,<br /><i><a href="/wiki/Animal_Crossing_(GCN)" title="Animal Crossing (GCN)">Animal Crossing</a></i>,<br /><i><a href="/wiki/Animal_Crossing:_Wild_World" title="Animal Crossing: Wild World">Animal Crossing: Wild World</a></i>,<br /><i><a href="/wiki/Animal_Crossing:_City_Folk" title="Animal Crossing: City Folk">Animal Crossing: City Folk</a></i>,<br /><i><a href="/wiki/Animal_Crossing:_New_Leaf" title="Animal Crossing: New Leaf">Animal Crossing: New Leaf</a></i>
  56. </td></tr></table>'''
  57.  
  58. from bs4 import BeautifulSoup
  59.  
  60. #import requests
  61. #r = requests.get('https://nookipedia.com/wiki/Pill_Bug')
  62. #html = r.content
  63.  
  64. soup = BeautifulSoup(html, "html.parser")
  65. tds = soup.find(id="Infobox-bug").find_all('td')
  66.  
  67. for x in tds:
  68.     print('>', x.get_text().strip())
  69.  
  70. print(tds[0])
  71. print(tds[1])
  72.  
  73. # --- RESULT ---
  74.  
  75. > Pill Bug
  76. >
  77. > Armadillidium vulgare
  78. > Armadillidiidae - Terrestrial Custaceans
  79. > All year
  80. > All day
  81. > Under rocks
  82. > 2 mm
  83. > Common
  84. > 250 Bells
  85. > Doubutsu no Mori,Animal Crossing,Animal Crossing: Wild World,Animal Crossing: City Folk,Animal Crossing: New Leaf
  86. <td colspan="2"> <big><big><b>Pill Bug</b></big></big>
  87. </td>
  88. <td colspan="2" style="background: #caecc9; border-radius: 10px; -moz-border-radius: 10px; -webkit-border-radius: 10px; -khtml-border-radius: 10px; -icab-border-radius: 10px; -o-border-radius: 10px;"> <a class="image" href="/wiki/File:Pill_Bug_Picture.jpg"><img alt="Pill Bug Picture.jpg" height="186" src="/w/images/b/bb/Pill_Bug_Picture.jpg" width="199"/></a>
  89. </td>
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement