PhD Seeker is a python web scraper to search for fully funded doctorate positions, advertised on well-known academic position websites.
Project description
PhD-Seeker ๐
PhD Seeker is a python web scraper to search for fully funded doctorate positions, advertised on well-known academic position websites.
If nowadays you are actively seeking a PhD position to pursue your studies, you must have realized that the process of searching for relevant vacancies is not straight forward. Visiting a large number of position advertising websites and encountering irrelevant commercials are two of the most common problems.
Simply modify the keywords and you will receive a CSV/XLSX file containing the last two pages from the most popular advertisers.
Sources ๐
Next Goals ๐ฏ
- Expanding the academic position advertising source
- Adding databases of different universities
- Finding and removing overlapped positions
- Adding LinkedIn search to get informed directly from university professors
- Getting the keywords from command line instead of hard-coding the source
- Fetching pages simultaneously
- Checking the availability of the new updates of
phdseeker
and notifying the user - GUI support
Installation
PyPi
phdseeker is available on PyPi:
$ pip install phdseeker
Git
You can get latest stable changes from github server:
$ git clone https://github.com/Aghababaei/PhD-Seeker.git
$ cd PhD-Seeker
$ python setup.py install
Zip, Tarball
You can download the latest tarball.
*nix
Get the latest tarball & install:
$ wget -O phdseeker-master.tar.gz https://github.com/Aghababaei/PhD-Seeker/archive/master.tar.gz
$ tar xvzf phdseeker-master.tar.gz && cd PhD-Seeker-main
$ python setup.py install
Windows
Downloading Archive
Download latest zip archive.
https://github.com/Aghababaei/PhD-Seeker/archive/master.zip
Decompress it, and run the following command in root directory of PhD-Seeker
$ python setup.py install
Adding Python to Windows environmental variables
Prior to running codes, make sure that Python has been already added to environmental variables as a PATH
, otherwise:
- Right-click This PC and going to Properties.
- Click on the Advanced system settings in the menu on the left.
- Click on the Environment Variables button oโn the bottom right.
- In the System variables section, select the Path variable and click on Edit. The next screen will show all the directories that are currently a part of the PATH variable.
- Click on New and entering Pythonโs install directory.
Requirements
phdseeker is relied on several great python packages. If you want to just run the code by calling the script, you need to install its dependencies, in advance.
pip install -r requirements.txt
Usage
phdseeker
Usage:
phdseeker -h
phdseeker -V
phdseeker --repolist
phdseeker [-k <keywords> --maxpage=<n> --output=<filetype(s)> -v]
options:
-h --help Show this screen.
-V --version Show version.
--repolist Show the list of repositories.
-v --verbose Show the found positions on the terminal.
-k <keywords>, --keywords=<keywords> Declare desired keywords to seek. [default: Computer Science, Machine Learning, Deep Learning]
-o <filetype(s)>, --output=<filetype(s)> Set the output type csv/xlsx/both [default: both]
--maxpage=<n> Maximum number of pages to fetch. [default: 10]
usage example
$ phdseeker -k 'Computer Science, Machine Learning' --maxpage=1 -v
Searching for the Keywords 'Computer Science, Machine Learning' in up to 1 page.
========================================::[ scholarshipdb ]::========================================
>> 704 positions found <<
===========================================::[ findaphd ]::==========================================
>> 72 positions found <<
>>>> 776 positions have been found in total.
Specifically, 21 records of them have been saved in the following files:
PhD_Positions_2022-08-08[Computer Science, Machine Learning].csv saved!
PhD_Positions_2022-08-08[Computer Science, Machine Learning].xlsx saved!
Elapsed time is 3.87
โโโโโโโโโโโโโโโโโโโโโณโโโโโโโโโโโโโโโโโโโโโณโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ Country โ Date โ Title โ
โกโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฉ
โ 1 Belgium โ โ Application of machine learning to screen hyperspectral โ
โ โ โ data for important soil and plant properties โ
โ 2 China โ โ PhD student (m/f/d) in the field of engineering, โ
โ โ โ computer science, technical software development, โ
โ โ โ mathematics, physics, data engineering or similar โ
โ 3 China โ โ SFI Centre for Research Training in Machine Learning โ
โ 4 Cyprus โ โ SFI Centre for Research Training in Machine Learning โ
โ 5 Denmark โ about 1 hour ago โ PhD Position in Computational Genetics and Machine โ
โ โ โ Learning: analysis of multi-omics biological data in โ
โ โ โ novel populations of Brachypodium โ
โ 6 Denmark โ about 18 hours ago โ PhD Stipend in Human-in-the-loop Data Mining and Deep โ
โ โ โ Learning on Graph Data (16-22068) โ
โ 7 Germany โ โ Discovery of new materials for applications on glass โ
โ โ โ using Deep Machine Learning and Data Analytics โ
โ 8 Germany โ โ Novel techniques for neuromorphic reservoir computing โ
โ 9 Netherlands โ 8 days ago โ PhD Candidate: Graph Neural Networks for Electricity and โ
โ โ โ Gas Networks โ
โ 10 Norway โ 3 months ago โ PhD Research Fellow in Informatics - Knowledge โ
โ โ โ Representation and Machine Learning โ
โ 11 Poland โ โ The Constitutive Law Establishment of Advanced High โ
โ โ โ Strength Steel based on Machine Learning โ
โ 12 Spain โ 14 days ago โ CALL 41-2022-1 Researcher in the Sustainable Artificial โ
โ โ โ Intelligence (SAI) research unit โ
โ 13 Suriname โ 8 days ago โ PhD "In silico prediction of antibiotic resistance" โ
โ โ โ (M/F) โ
โ 14 Sweden โ about 12 hours ago โ PhD student in Computational Science and Engineering โ
โ โ โ with focus on Optimization for Federated Machine โ
โ โ โ Learning โ
โ 15 Switzerland โ about 2 months ago โ PhD position in the field of Machine Learning โ
โ โ โ (Graph-based High-dimensional generative models) with โ
โ โ โ application to Medical Data Analysis at the Department โ
โ โ โ of Computer Science โ
โ 16 United Kingdom โ about 20 hours ago โ Research Assistant โ
โ 17 United Kingdom โ 3 days ago โ PhD Studentship: Implementation of Machine Learning at โ
โ โ โ the Edge โ
โ 18 United Kingdom โ โ Machine Learning Meets Sequential Monte Carlo Methods โ
โ 19 United Kingdom โ โ PhD Studentship in Computer Science โ
โ 20 United Kingdom โ โ SFI Centre for Research Training in Machine Learning โ
โ 21 United Kingdom โ โ Scholarship for the PhD in Medical Sciences in the โ
โ โ โ fields of Neuroscience and Biomedical Engineering for โ
โ โ โ the PhD Research Project โDevelopment of a closed-loop โ
โ โ โ controller for automatic administration of anaesthetic โ
โ โ โ and analgesic agents during surgery using machine โ
โ โ โ learning methodsโ โ
โโโโโโโโโโโโโโโโโโโโโดโโโโโโโโโโโโโโโโโโโโโดโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ Out of 776 found Ph.D. positions, 21 have been fetched in 1 page โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
Contributors
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file phdseeker-0.4.4.tar.gz
.
File metadata
- Download URL: phdseeker-0.4.4.tar.gz
- Upload date:
- Size: 25.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.10.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 46ad3c599e04993b32f75f56819ccf1b075e015065c871ed89ef54002e38f5a0 |
|
MD5 | ef226c7295c87cba3a398e8bfb7c340c |
|
BLAKE2b-256 | 667f157919c9a433baba822240d8c5f351261b91a936591f8cd9db75bbc2d5bf |