scrapegraphai

View on PyPIReverse Dependencies (1)

1.21.0 scrapegraphai-1.21.0-py3-none-any.whl
1.20.1 scrapegraphai-1.20.1-py3-none-any.whl

Wheel Details

Project: scrapegraphai
Version: 1.20.1
Filename: scrapegraphai-1.20.1-py3-none-any.whl
Download: [link]
Size: 124155
MD5: ab9a341c46b0cbe2f269f8d2813eb95b
SHA256: 71de55b3292f893a09a41cfa66e6e9f5fe668ff90f81fe4d0732da4016018915
Uploaded: 2024-09-16 16:13:23 +0000

dist-info

METADATA

Metadata-Version: 2.3
Name: scrapegraphai
Version: 1.20.1
Summary: A web scraping library based on LangChain which uses LLM and direct graph logic to create scraping pipelines.
Author-Email: Marco Vinciguerra <mvincig11[at]gmail.com>, Marco Perini <perinim.98[at]gmail.com>, Lorenzo Padoan <lorenzo.padoan977[at]gmail.com>
Keywords: ai,artificial intelligence,gpt,graph,langchain,machine learning,natural language processing,nlp,openai,rag,scrapegraph,scrapegraphai,scraping,web scraping,web scraping library,web scraping tool,webscraping
Classifier: Intended Audience :: Developers
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: <4.0,>=3.9
Requires-Dist: beautifulsoup4 (>=4.12.3)
Requires-Dist: faiss-cpu (>=1.8.0)
Requires-Dist: free-proxy (>=1.1.1)
Requires-Dist: google (>=3.0.0)
Requires-Dist: html2text (>=2024.2.26)
Requires-Dist: langchain-aws (>=0.1.3)
Requires-Dist: langchain-community (>=0.2.9)
Requires-Dist: langchain-google-genai (>=1.0.7)
Requires-Dist: langchain-mistralai (>=0.1.12)
Requires-Dist: langchain-openai (>=0.1.22)
Requires-Dist: langchain (>=0.2.14)
Requires-Dist: minify-html (>=0.15.0)
Requires-Dist: pandas (>=2.2.2)
Requires-Dist: playwright (>=1.43.0)
Requires-Dist: python-dotenv (>=1.0.1)
Requires-Dist: semchunk (>=1.0.1)
Requires-Dist: tiktoken (>=0.7)
Requires-Dist: tqdm (>=4.66.4)
Requires-Dist: undetected-playwright (>=0.3.0)
Requires-Dist: burr[start] (==0.22.1); extra == "burr"
Requires-Dist: furo (==2024.5.6); extra == "docs"
Requires-Dist: sphinx (==6.0); extra == "docs"
Requires-Dist: browserbase (>=0.3.0); extra == "more-browser-options"
Requires-Dist: graphviz (>=0.20.3); extra == "more-semantic-options"
Requires-Dist: langchain-anthropic (>=0.1.11); extra == "other-language-models"
Requires-Dist: langchain-fireworks (>=0.1.3); extra == "other-language-models"
Requires-Dist: langchain-google-vertexai (>=1.0.7); extra == "other-language-models"
Requires-Dist: langchain-groq (>=0.1.3); extra == "other-language-models"
Requires-Dist: langchain-huggingface (>=0.0.3); extra == "other-language-models"
Requires-Dist: langchain-nvidia-ai-endpoints (>=0.1.6); extra == "other-language-models"
Provides-Extra: burr
Provides-Extra: docs
Provides-Extra: more-browser-options
Provides-Extra: more-semantic-options
Provides-Extra: other-language-models
Description-Content-Type: text/markdown
License-Expression: MIT
License-File: LICENSE
[Description omitted; length: 11374 characters]

WHEEL

Wheel-Version: 1.0
Generator: hatchling 1.25.0
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
scrapegraphai/__init__.py sha256=zEe125Am9yVBw7UyzFgt73IumtoeOnvtS1SYYldR-3U 54
scrapegraphai/builders/__init__.py sha256=eZeScT8SZ2i5QxmeAdK5oX8PGByMY-J-gH05R0h3mtY 90
scrapegraphai/builders/graph_builder.py sha256=1l5XTFjuINVha6n9NT2q-nx578Ohlvb5MC5iAqsQ2zw 6498
scrapegraphai/docloaders/__init__.py sha256=lTNLYtVFsgkfibrajMVl9sQG6jM0TgV8mBfds1iVbiM 128
scrapegraphai/docloaders/browser_base.py sha256=ecTfBg8qDzdGw08R9erNdH1x8ZJt43sBcYtsjYWJhrE 2708
scrapegraphai/docloaders/chromium.py sha256=2-QWFwHS2JgaDiQRloIkc__eGpG2Hneb7IW6_-piswQ 4851
scrapegraphai/graphs/__init__.py sha256=W3uL-XMwwSzQtoYnmGQN_yjVS5CxlaZfBqz6a3rF7e0 1217
scrapegraphai/graphs/abstract_graph.py sha256=TzxeZpYWToDUOeTFl_e7Q2pv5NcEwqaKkFXYMMvy4bU 7592
scrapegraphai/graphs/base_graph.py sha256=8dwFR_3Tsi_fDJY68moqX111olJRnlClA1Lx0B7xxHM 11612
scrapegraphai/graphs/csv_scraper_graph.py sha256=fmqzoareenCCg2MoOySIqrKFNBC-m8ibTTLqa7rX4Zg 3761
scrapegraphai/graphs/csv_scraper_multi_graph.py sha256=ev9l6Dgg7EPwMEguyje8pC-XYFCBqJALSYZJLTZAQkg 3691
scrapegraphai/graphs/deep_scraper_graph.py sha256=68lCeWL5Hl0hlv5r1sLXu5r6PauQABuD8SU1o1_k8xU 5426
scrapegraphai/graphs/json_scraper_graph.py sha256=m1BygF7CWTpsOUcvKflUWItj0GoMQ0vw6dwL3nXqXy0 3153
scrapegraphai/graphs/json_scraper_multi_graph.py sha256=Xp8NTXGQVH5xk5RS4ks50kSAM9E-6cDPZZyiWoQk9KU 3786
scrapegraphai/graphs/markdown_scraper_graph.py sha256=AVD6wR3T0zoju31HuJEgLdxd_gjN-lue2xrJgl6OMB0 3667
scrapegraphai/graphs/markdown_scraper_multi_graph.py sha256=xqux8TViUtyjd7oD3MUiYps2pvSdo4yWMMNdUln0r94 3518
scrapegraphai/graphs/omni_scraper_graph.py sha256=aNkDH31nlaBJTqERMc7luHSYwoJgw7rgre0Tl0m-psM 4310
scrapegraphai/graphs/omni_search_graph.py sha256=manoW789Fcou_5ecb6CTd-AFo3OZ1Q449tzG0usLYvk 4110
scrapegraphai/graphs/pdf_scraper_graph.py sha256=X--eRBzSo-gUnqsylrCCdopsHJha4KUXbBgw5IUHfcg 3589
scrapegraphai/graphs/pdf_scraper_multi_graph.py sha256=rx9dQvcFIqzpLx_-xP-5eZpU7JnUoY-k7lRZc2Jcjfo 3733
scrapegraphai/graphs/screenshot_scraper_graph.py sha256=fIjhQX57aK0XbjssUgrjOBPlJ4NuKena0UCfgEKgyks 2621
scrapegraphai/graphs/script_creator_graph.py sha256=wzD2ae0XUdm_W-bi5akTxDXILL-wkTprNhMG4bm-Gko 3929
scrapegraphai/graphs/script_creator_multi_graph.py sha256=_qwE39wV1zG6i9uw5d9mi4BQy5js9V4KgsFJ8kpLwDY 3826
scrapegraphai/graphs/search_graph.py sha256=2GL6oLq73vDSJEdvH6wL4D-rkAxdYJtsf_CIJoGrTsc 4399
scrapegraphai/graphs/search_link_graph.py sha256=Sv0y9ZejuQwd1-GpzrpB-6cgOOeFDZkmtkyjqZKcK6M 3817
scrapegraphai/graphs/smart_scraper_graph.py sha256=vHgi0qeVDrYHhGkUORZ70kBEYJ5ffZyh_nHDca5BsfE 3922
scrapegraphai/graphs/smart_scraper_multi_graph.py sha256=1IbAZ_V7Qf4EUitIJl9ywLZvLmx40DIHrAafFs01jlU 3810
scrapegraphai/graphs/speech_graph.py sha256=SZDIN9dzF2Tw4apPl44awqjokz3FICmJVIV_gDpE8EE 4346
scrapegraphai/graphs/xml_scraper_graph.py sha256=AfoW7_C6UKIqIOkC_dblG2wZh8JZk9WkJG-B3K1h11s 3284
scrapegraphai/graphs/xml_scraper_multi_graph.py sha256=qc0q1Pj4EbeglCd0pVguOSfm9X3qEsDj848oYf-GE2o 3740
scrapegraphai/helpers/__init__.py sha256=C58Mx0uwe_XZqvu11ugbg-UMobDa8gndf7Bp-vwy9-g 201
scrapegraphai/helpers/default_filters.py sha256=-BRpKoX-3vrbnm-MFXrYVl1R514n1h2gNQmSW0a5008 446
scrapegraphai/helpers/models_tokens.py sha256=lDuv6U4qLUWNJqkrN7eeXIsYjv3dDeNlQIhUmKyCsOg 8144
scrapegraphai/helpers/nodes_metadata.py sha256=6KUP0vs6zhCN720RNigWwBhBNiilFJy3yHhWUiCJWkI 3807
scrapegraphai/helpers/robots.py sha256=dlVh1F-T4pYCj06hDKk81TUY2jHLxpXrgs_e4uYKsOs 310
scrapegraphai/helpers/schemas.py sha256=0pwY9KPCgJ_9D4oLFV9wWspWg4Qet2rG3B-v70CSWAo 2363
scrapegraphai/integrations/__init__.py sha256=TKGD1AJTvsddtu3d73oT8v9s7byzXpHzT9L5dX-5YpA 118
scrapegraphai/integrations/burr_bridge.py sha256=GAcFQU4K3HjR-90-hk-cV1SCny-E8yCSR6GeDCBpucA 7581
scrapegraphai/integrations/indexify_node.py sha256=hlchEaBc_Dl04HLBHoAaOqI8yT4CrFmqk0Uobp2rdAQ 2530
scrapegraphai/models/__init__.py sha256=y0N450Ljn6dq72KqDpxiX2Mh-orX-hsIiH37WyYEcMk 190
scrapegraphai/models/deepseek.py sha256=PCUKq6c8RiG8mmvNfH7lybB6HWbiN6WuJoBzq8YhdRg 635
scrapegraphai/models/oneapi.py sha256=EPnSsUTfD4awRQhwCxqG3GXUh7D2-O-A7Oyc_Aps9RE 510
scrapegraphai/models/openai_itt.py sha256=TfZZKuU7wskOQ4YPr0NzClOImOLwPQnqqydwzgiScBU 1367
scrapegraphai/models/openai_tts.py sha256=NKl1j5IgvOnafuVnlZyDvIbIS8C9kzGfhf-gM55ck98 1294
scrapegraphai/nodes/__init__.py sha256=z2cH3WO85C5W1RlihhR9QHVUJFCMR0NiY3Z5k6peasA 1035
scrapegraphai/nodes/base_node.py sha256=QAk2SV-2VHJOqdriY8OwgGCoocai5IRXOQ83YxsVaBk 8880
scrapegraphai/nodes/conditional_node.py sha256=4w4wuj2G4gpL9jYGtGFq9oLVVCZcOIjWWEuvmuMsY80 1779
scrapegraphai/nodes/fetch_node.py sha256=svSw9xMZtEN3DynlL6zh924rog3nao-rme2LxzmImYU 12271
scrapegraphai/nodes/fetch_screen_node.py sha256=uTvNWMa4sclalF7FqBk--QQyZ6aki1XjUDyi4pCBvdE 1689
scrapegraphai/nodes/generate_answer_csv_node.py sha256=wXxlYv8L_OUZppd5R3j2pffaWhbmNm-Jl3wIZRsEGgA 6120
scrapegraphai/nodes/generate_answer_from_image_node.py sha256=jZSUOtcz7R5UzPUP8uwT9WgxEzGYV_qFXHFWOb9Igxc 3881
scrapegraphai/nodes/generate_answer_node.py sha256=7CJKXyez4KOJyskLtyZj6ipiHg8cFekOSlWy1CB2iLQ 7185
scrapegraphai/nodes/generate_answer_omni_node.py sha256=f0C47t8FlZG6DRZVr79Ay0f2FP3ewkqEYAHVJxwdFg0 5764
scrapegraphai/nodes/generate_answer_pdf_node.py sha256=05d0gFI9qCmjcXjQiDBzXkup5sH-sS8e_C8mbm8rPWs 6246
scrapegraphai/nodes/generate_scraper_node.py sha256=OzCYByuPQn29z_-dqwRONH1hX42ZwwfT5fwhNWHiblY 4791
scrapegraphai/nodes/get_probable_tags_node.py sha256=qgsJKr8_7XSww5kGFQdjOsznhBhq1LYjgrpPVhEbg1I 3520
scrapegraphai/nodes/graph_iterator_node.py sha256=11_fmoZrWXctDFi_om0lHyU-umGCRI7NGGzWhyVwG9g 4521
scrapegraphai/nodes/image_to_text_node.py sha256=bed_dujoaCkAPsrwZn2_2VrkiLZ5Ff6lct-UfE2Tqzs 2678
scrapegraphai/nodes/merge_answers_node.py sha256=D-YIrVBY-Ku4JajrxbVhM0utu3p_c2hThqeh2a7c4NI 3198
scrapegraphai/nodes/merge_generated_scripts.py sha256=zx2HvhdqJqr0zIChlA7fIXvttOpf5BYPao-WRzkc8Bg 4466
scrapegraphai/nodes/parse_node.py sha256=qVmthy_AIDLjXbH0i4b6Qu6WJdfHVq6AMiQZ1VJQU9A 3589
scrapegraphai/nodes/rag_node.py sha256=WUFQRNyGmDpDztsI4P5k5SVzjJL6bEemt7Ts5HF8n9Y 11610
scrapegraphai/nodes/robots_node.py sha256=GE-85GO0ev7ohh87DyrJhckePcRksSjddwigSIq6UGY 5179
scrapegraphai/nodes/search_internet_node.py sha256=5eEM_KxoBYK_T6w1VJsrgAegT02TaUdM_KFGDdqtMcY 3902
scrapegraphai/nodes/search_link_node.py sha256=WsY-8U6Oe-WQh3OzMo_2fce--M9VP8o9dvsq5z5oR_A 6236
scrapegraphai/nodes/search_node_with_context.py sha256=3RedfGLxrISCbmDoFW68yr1wjAfLKflxamidVFrVUOs 3829
scrapegraphai/nodes/text_to_speech_node.py sha256=1wuJ_ynCd2_EaWet12PmOM-j7fcl5_4XB03T6X9PT4w 2188
scrapegraphai/prompts/__init__.py sha256=IVrLdpw8-BeyawiSyP7F_GmhGErhFqNEsNDMade205Y 891
scrapegraphai/prompts/generate_answer_node_csv_prompts.py sha256=VTVJ0-G9R3WoaVvbSwt2KDZ-AkvDCg-gu4AwzIhbuF0 1816
scrapegraphai/prompts/generate_answer_node_omni_prompts.py sha256=TK1IPkA_bc2czViji_eJ_gSinHNWZY9vONdVZRw3FTY 2111
scrapegraphai/prompts/generate_answer_node_pdf_prompts.py sha256=p484EwVf2oSb8G2mjnb37dWER_O_vk8PpPwJjJgdRWk 1823
scrapegraphai/prompts/generate_answer_node_prompts.py sha256=MzkNgwADhODXTpgeUq09ewDzVh3ReVljXHHmN8G8awE 3684
scrapegraphai/prompts/merge_answer_node_prompts.py sha256=Vn7vvPDrGK_Je1ooCT8gN_tsrMGBwa1L6qFcrSel9X4 613
scrapegraphai/prompts/robots_node_prompts.py sha256=gug6WJ6Kl0AQg21il7lszmtj8LicdE7iuAKPIzAFftg 700
scrapegraphai/prompts/search_internet_node_prompts.py sha256=FLC-Gd4zYRIGAH8tN61vDvdG9l0eLurWJSVI1igh5w4 734
scrapegraphai/prompts/search_link_node_prompts.py sha256=qCbfE_C9cozIdAIVNhC26EXIb-MVSgHkzuHxAqgM0Vs 676
scrapegraphai/prompts/search_node_with_context_prompts.py sha256=8q1rwAKm2iNBm6l_OK1a2mq7FEpd3N8RrLqAL14MDhg 1024
scrapegraphai/telemetry/__init__.py sha256=yucN5mTsjXJjlbcZFPnmXifWe4DuqRbmP_MS3IsRq7c 154
scrapegraphai/telemetry/telemetry.py sha256=9vP7P-0QV2wCe5S_0qPeyQuMPjWayFBOl3jXtsBKRm4 6941
scrapegraphai/utils/__init__.py sha256=WpzzllViW5VrA-oaaACaEr9lfSVS63oBYfrw6r-U29o 488
scrapegraphai/utils/cleanup_html.py sha256=8xaxXffFWF3j1qHHzbriiYwCYLI3OE6nAlhpg8G1V3Q 1889
scrapegraphai/utils/convert_to_csv.py sha256=jkVXZt-oFBVjqcZv39_UaSZ-GnEvvuemgHWFlGBTDNw 1977
scrapegraphai/utils/convert_to_json.py sha256=5Zaov6WHZr8xbK0eiLTmfFRLd6UCYjJZj4OnmaNQbhc 1914
scrapegraphai/utils/convert_to_md.py sha256=kkN_PlvStYeLsKqhBLzJc-y5PN5lXawp4JdKkO-44cI 961
scrapegraphai/utils/copy.py sha256=ed0YX2-mwPtSUcbj9BNsJrO_M7N00JZ4RiobnOqxsXQ 2184
scrapegraphai/utils/logging.py sha256=uosOFmUz_WSxI-rf3O2LR5um6VCSnV6kj5LHcf_wU5g 6001
scrapegraphai/utils/parse_state_keys.py sha256=2XLExH1qeokvYmz39KAuMF2p9hsklxjGcQeYyE8j6ZU 3463
scrapegraphai/utils/prettify_exec_info.py sha256=UxtirABoh9XQeQ5lOHP6uyENGXooxzcuMKNa-lnoBBE 763
scrapegraphai/utils/proxy_rotation.py sha256=L_n0hSS8v4W8sKyOEmjka5okABCK3uE-6Qlv2Ce1c9U 6436
scrapegraphai/utils/research_web.py sha256=NCLwZ41KN4z8VuXaxje4j7nVeK_Tv7BhT-EVto_TQNc 2784
scrapegraphai/utils/save_audio_from_bytes.py sha256=n5S3Oxr_2jh2QzzIZhGvmgMXu1Ok0dQFsCK0PatsfnA 844
scrapegraphai/utils/sys_dynamic_import.py sha256=HHQBWiL7BgfNU_bQsxo5yLWrJdZqs__Dd-uB3rOJdx0 1620
scrapegraphai/utils/token_calculator.py sha256=Qi_FuG4svAXzCbi6CPqAwekfhb2_JE79MhZ3l-yRTkg 1362
scrapegraphai-1.20.1.dist-info/METADATA sha256=B0POcfjk1M64ZMvwKs_a6jud9-N5WNg9M1mWo38UzSs 13872
scrapegraphai-1.20.1.dist-info/WHEEL sha256=1yFddiXMmvYK7QYTqtRNtX66WJ0Mz8PYEiEUoOUUxRY 87
scrapegraphai-1.20.1.dist-info/licenses/LICENSE sha256=TfmkR6cAMxXPyFrgNWB0Pm3PdPMjRQwquIuzZqcF_D4 1065
scrapegraphai-1.20.1.dist-info/RECORD