From 243ef2818912d10090b72f8445e88a94643456c5 Mon Sep 17 00:00:00 2001 From: Sebi Date: Fri, 5 Jun 2026 11:40:33 +0300 Subject: [PATCH 1/4] Add job-seeker-ro-spider topic page --- topics/job-seeker-ro-spider/index.md | 14 ++++++++++++++ 1 file changed, 14 insertions(+) create mode 100644 topics/job-seeker-ro-spider/index.md diff --git a/topics/job-seeker-ro-spider/index.md b/topics/job-seeker-ro-spider/index.md new file mode 100644 index 00000000000..2d8ce8cd00c --- /dev/null +++ b/topics/job-seeker-ro-spider/index.md @@ -0,0 +1,14 @@ +--- +display_name: Peviitor Job Spider +short_description: Web scrapers that collect job titles and application links from Romanian company career pages and push them to peviitor.ro. +topic: job-seeker-ro-spider +related: web-scraping, data-extraction, automation, github-actions, open-source, etl +--- + +**Peviitor Job Spider** is a collection of web scrapers that extract job titles and application links from Romanian company career pages and index them in [peviitor.ro](https://peviitor.ro) — a Romanian job search engine. + +Each scraper targets one company, validates legal identity through the ANAF public API, extracts job listings from HTML or JSON sources, and pushes structured data to Apache Solr. The ecosystem includes Node.js (ESM, cheerio) and Python implementations, all deployed via GitHub Actions on a cron schedule. + +Part of the peviitor.ro open-source community — a crowd-sourced job aggregator that keeps Romania's tech job market up to date. + +Want to contribute? Whether you prefer Java, JavaScript, Rust, Go, Nim, R, or any other language — you're welcome to write your own scraper and help grow peviitor.ro! From f19236d3ba9a380922868b2680add2997f0f084c Mon Sep 17 00:00:00 2001 From: Sebi Date: Fri, 5 Jun 2026 11:41:55 +0300 Subject: [PATCH 2/4] Add peviitor.ro url to frontmatter --- topics/job-seeker-ro-spider/index.md | 1 + 1 file changed, 1 insertion(+) diff --git a/topics/job-seeker-ro-spider/index.md b/topics/job-seeker-ro-spider/index.md index 2d8ce8cd00c..2752cbd3a64 100644 --- a/topics/job-seeker-ro-spider/index.md +++ b/topics/job-seeker-ro-spider/index.md @@ -3,6 +3,7 @@ display_name: Peviitor Job Spider short_description: Web scrapers that collect job titles and application links from Romanian company career pages and push them to peviitor.ro. topic: job-seeker-ro-spider related: web-scraping, data-extraction, automation, github-actions, open-source, etl +url: https://peviitor.ro --- **Peviitor Job Spider** is a collection of web scrapers that extract job titles and application links from Romanian company career pages and index them in [peviitor.ro](https://peviitor.ro) — a Romanian job search engine. From 696732ffd4fcbde0614dc75f599f97e8cb82c640 Mon Sep 17 00:00:00 2001 From: Sebi Date: Fri, 5 Jun 2026 11:43:22 +0300 Subject: [PATCH 3/4] Update display_name to job-seeker-ro-spider --- topics/job-seeker-ro-spider/index.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/topics/job-seeker-ro-spider/index.md b/topics/job-seeker-ro-spider/index.md index 2752cbd3a64..f6326bf6de9 100644 --- a/topics/job-seeker-ro-spider/index.md +++ b/topics/job-seeker-ro-spider/index.md @@ -1,12 +1,12 @@ --- -display_name: Peviitor Job Spider +display_name: job-seeker-ro-spider short_description: Web scrapers that collect job titles and application links from Romanian company career pages and push them to peviitor.ro. topic: job-seeker-ro-spider related: web-scraping, data-extraction, automation, github-actions, open-source, etl url: https://peviitor.ro --- -**Peviitor Job Spider** is a collection of web scrapers that extract job titles and application links from Romanian company career pages and index them in [peviitor.ro](https://peviitor.ro) — a Romanian job search engine. +**job-seeker-ro-spider** is a collection of web scrapers that extract job titles and application links from Romanian company career pages and index them in [peviitor.ro](https://peviitor.ro) — a Romanian job search engine. Each scraper targets one company, validates legal identity through the ANAF public API, extracts job listings from HTML or JSON sources, and pushes structured data to Apache Solr. The ecosystem includes Node.js (ESM, cheerio) and Python implementations, all deployed via GitHub Actions on a cron schedule. From c94e42d97f7fad7aa22d68a89484e80c4ce18128 Mon Sep 17 00:00:00 2001 From: Sebi Date: Fri, 5 Jun 2026 11:46:57 +0300 Subject: [PATCH 4/4] Remove open-source from description per test requirement --- topics/job-seeker-ro-spider/index.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/topics/job-seeker-ro-spider/index.md b/topics/job-seeker-ro-spider/index.md index f6326bf6de9..ac934521685 100644 --- a/topics/job-seeker-ro-spider/index.md +++ b/topics/job-seeker-ro-spider/index.md @@ -10,6 +10,6 @@ url: https://peviitor.ro Each scraper targets one company, validates legal identity through the ANAF public API, extracts job listings from HTML or JSON sources, and pushes structured data to Apache Solr. The ecosystem includes Node.js (ESM, cheerio) and Python implementations, all deployed via GitHub Actions on a cron schedule. -Part of the peviitor.ro open-source community — a crowd-sourced job aggregator that keeps Romania's tech job market up to date. +Part of the peviitor.ro community — a crowd-sourced job aggregator that keeps Romania's tech job market up to date. Want to contribute? Whether you prefer Java, JavaScript, Rust, Go, Nim, R, or any other language — you're welcome to write your own scraper and help grow peviitor.ro!