| 
					
				 | 
			
			
				@@ -20,10 +20,16 @@ Quick sneak peak on the Entity dataset on Datastore: 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 --- 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				  
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 ## Installation 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				+You can replicate this pipeline directly on your local machine or on the cloud shell on GCP. 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				+  
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 **Requirements:** 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 - Clone this repo to your local machine using https://github.com/azizketari/covid19_ISMIR.git 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 - You need a Google Cloud project and IAM rights to create service accounts. 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				-- Enable APIs 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				+- Set the project that you will be working on: 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				+ 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				+`gcloud config set project PROJECT_ID` 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				+ 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				+- Enable APIs: 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 ``` 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 gcloud services enable vision.googleapis.com 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 gcloud services enable translate.googleapis.com 
			 | 
		
	
	
		
			
				| 
					
				 | 
			
			
				@@ -32,8 +38,11 @@ gcloud services enable bigquery.googleapis.com 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 ``` 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				  
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 - Install package requirements: 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				- 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				-```pip install -r requirements.txt``` 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				+  
			 | 
		
	
		
			
				 | 
				 | 
			
			
				+``` 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				+cd ~/covid19_ISMIR 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				+pip3 install --user -r requirements.txt 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				+``` 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				  
			 | 
		
	
		
			
				 | 
				 | 
			
			
				  
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 Note: 
			 | 
		
	
	
		
			
				| 
					
				 | 
			
			
				@@ -44,21 +53,18 @@ will automatically download a model for you and install it. 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				  
			 | 
		
	
		
			
				 | 
				 | 
			
			
				  
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 ## Extracting data 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				-- **Step 0:** Navigate to the cloned repo on your local machine 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				-  
			 | 
		
	
		
			
				 | 
				 | 
			
			
				-`cd ~/covid19_ISMIR` 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				  
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 - **Step 1:** Modify the values to each variables in env_variables.sh file then run 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 > Assumption: You have already created/downloaded the json key to your Google Cloud Service Account. Useful [link](https://cloud.google.com/iam/docs/creating-managing-service-account-keys#iam-service-account-keys-create-python) 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 ``` 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				-./env_variables.sh 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				+source env_variables.sh 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 ``` 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				  
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 - **Step 2:** Download the required files to your bucket and load the required model in your local   
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 (this step will take ~10 min) 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 > Optional: If you have already downloaded the scispacy model, you should modify the file ./content/download_content.sh to not repeat that step 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 ``` 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				-sh ~/content/download_content.sh 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				+source ./content/download_content.sh 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 pip install -U ./scispacy_models/en_core_sci_lg-0.2.4.tar.gz 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				 ``` 
			 | 
		
	
		
			
				 | 
				 | 
			
			
				  
			 |