Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
15 changes: 15 additions & 0 deletions topics/job-seeker-ro-spider/index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
---
display_name: job-seeker-ro-spider
short_description: Web scrapers that collect job titles and application links from Romanian company career pages and push them to peviitor.ro.
topic: job-seeker-ro-spider
related: web-scraping, data-extraction, automation, github-actions, open-source, etl
url: https://peviitor.ro
---

**job-seeker-ro-spider** is a collection of web scrapers that extract job titles and application links from Romanian company career pages and index them in [peviitor.ro](https://peviitor.ro) — a Romanian job search engine.

Each scraper targets one company, validates legal identity through the ANAF public API, extracts job listings from HTML or JSON sources, and pushes structured data to Apache Solr. The ecosystem includes Node.js (ESM, cheerio) and Python implementations, all deployed via GitHub Actions on a cron schedule.

Part of the peviitor.ro community — a crowd-sourced job aggregator that keeps Romania's tech job market up to date.

Want to contribute? Whether you prefer Java, JavaScript, Rust, Go, Nim, R, or any other language — you're welcome to write your own scraper and help grow peviitor.ro!