from Bimo@lemmy.world to python@programming.dev on 17 Aug 2024 18:49
https://lemmy.world/post/18764967
<img alt="" src="https://lemmy.world/pictrs/image/abd3b43a-c119-427a-a4a0-c27c0e16f96b.png">
<img alt="" src="https://lemmy.world/pictrs/image/ec9b583b-cacf-4ec1-96ad-10b993a63439.jpeg">
First thing first, while I started to build this package I’ve made an error with the word <scraper> that I’ve misspelled in <scrapper> I’m not a native english speaker. I’m planning to change the name and correct it. So don’t be mad with me about it. Ok now let me introduce my first python package.
aba-cli-scrapper
** i’ts a cli tool to easily build a dataset from Alibaba.
look at the repo to know more about this project : github.com/poneoneo/Alibaba-CLI-Scraper
I’m excited to share the latest release of my first Python package, aba-cli-scrapper designed to facilitate data extraction from Alibaba. This command-line tool enables users to build a comprehensive dataset containing valuable information on products and suppliers associated . The extracted data can be stored in either a MySQL or SQLite database, with the option to convert it into CSV files from the SQLite file.
The latest feature will be an ai agent to chat with about data saved. I’m working on this. Its should be release very a soon.
Key Features:
–Asynchronous mode for faster scraping of page results using Bright-Data API key (configuration required)
–Synchronous mode available for users without an API key (note: proxy limitations may apply)
–Supports data storage in MySQL or SQLite databases
–Text mode for thoses who are not comfortable with cli app
–Converts data to CSV files from SQLite database
Seeking Feedback and Contributions:
I’d love to hear your thoughts on this project and encourage you to test it out. Your feedback and suggestions on the package’s usefulness and potential evolution are invaluable. Future plans include adding a RAG feature to enhance database interactions.
Feel free to try out aba-cli-scrapper and share your experiences! Leave a start if you liked .
#python
threaded - newest
Seems well organized. Nice interface too.
Oh thank you guys did you test it.
I haven’t tested it.