Uncategorized – Curiosity

Freelance Social

May 8, 2024 by N Guruprasad

Cuboid is an mobile app, which helps people to search for local and online jobs in part-time on their own topic of interests and earn money.
Cuboid helps people to get their work done by just peeking into the Cuboid and hire the best skilled person around you or just post for the job to get many requests from the skilled person around.
Cuboid has two sides ,One which allows hirer to post their job to be done on the Cuboid-wall and second ,worker search for the jobs on his/her area of interests.
Cuboid made the money transition easy by introducing the Cube-Wallet where people can send and receive money after the work have done.
Now people can spend there money in Cube-wallet to pay bills online ,recharging ,pay to shops and restaurants all around easily.
They can also share money among their friends and families.

Google News Scraper

May 8, 2024 by N Guruprasad

Github Link: https://github.com/guruprasadbotics/google_news_scraper_tocsv/tree/main

This Google news scraper can scrape news based on a given keyword and the relevant searches are put in a CSV for the download.

Install necessary packages:

pip install requests
pip install beautifulsoup4

Install the necessary libraries:

import requests
from bs4 import BeautifulSoup
import pandas as pd

Copy and paste the following code:

import requests
from bs4 import BeautifulSoup
import pandas as pd

# Search Query
query = 'Indian Stock Market'

# Encode special characters in a text string
def encode_special_characters(text):
    encoded_text = ''
    special_characters = {'&': '%26', '=': '%3D', '+': '%2B', ' ': '%20'}  # Add more special characters as needed
    for char in text.lower():
        encoded_text += special_characters.get(char, char)
    return encoded_text

query2 = encode_special_characters(query) # the query given above as string encoded as special characters
url = f"https://news.google.com/search?q={query2}&hl=en-US&gl=US&ceid=US%3Aen" # the complete URL with the query

response = requests.get(url) # html response
soup = BeautifulSoup(response.text, 'html.parser') # parsing the html code

articles = soup.find_all('article')
links = [article.find('a')['href'] for article in articles]
links = [link.replace("./articles/", "https://news.google.com/articles/") for link in links]

news_text = [article.get_text(separator='\n') for article in articles]
news_text_split = [text.split('\n') for text in news_text]

news_df = pd.DataFrame({
    'Title': [text[2] for text in news_text_split],
    'Source': [text[0] for text in news_text_split],
    'Time': [text[3] if len(text) > 3 else 'Missing' for text in news_text_split],
    'Author': [text[4].split('By ')[-1] if len(text) > 4 else 'Missing' for text in news_text_split],
    'Link': links
}) # converting the responses into a data frame and writing it as a .csv file

# Write to CSV
news_df.to_csv('news.csv', index=False)

Playstore Scraper

May 8, 2024 by N Guruprasad

Github link: https://github.com/guruprasadbotics/playstore_scraper/

This Python code scrapes app data from the Play Store and saves it as a CSV file. It extracts details like titles, ratings, descriptions, and more, for further analysis or comparison. with competitors.

You can scrape and download data from your competitors find out where they are lacking and use it to your advantage.

How to use it?

Open in your Google Collab
Replace the file_name with your actual file name
Replace the drive location with df_busu.to_csv(‘/content/drive/My Drive/file_name.csv’) # path to save your exported CSV file

Use Cases:

To do sentiment analysis
Get a better understanding of in-depth positive and negative feedback
You can also run this as a cron to get and connect it to Analytical tools such as PowerBI.

Import the required libraries

pip install google-play-scraper

from google_play_scraper import app
import pandas as pd
import numpy as np

Enter the app package name from the Play Store URL, also mention the country, language, and the sort method you want:

from google_play_scraper import Sort, reviews_all

us_reviews = reviews_all(
    'app.rocket.com', # package name of the app you want to scrape
    sleep_milliseconds=0, # defaults to 0
    lang='en', # defaults to 'en'
    country='in', # defaults to 'in'
    sort=Sort.NEWEST, # defaults to Sort.MOST_RELEVANT
)

Convert it into a data frame:

df_busu = pd.DataFrame(np.array(us_reviews),columns=['review'])

df_busu = df_busu.join(pd.DataFrame(df_busu.pop('review').tolist()))

# to displpay the top reviews
df_busu.head()

# Data will appear here

Convert the data frame to CSV:

df_busu.to_csv('file_name.csv') # your filename here

from google.colab import drive
drive.mount('/content/drive') # mounting google drive to store

df_busu.to_csv('/content/drive/My Drive/file_name.csv') # path to save your exported csv file

Check your drive, you’ll find the scraper review data in the specified “.csv” format.