🔥 Burn Fat Fast. Discover How! 💪

Data Scientology

Logo of telegram channel datascientology — Data Scientology D
Logo of telegram channel datascientology — Data Scientology
Channel address: @datascientology
Categories: Uncategorized
Language: English
Subscribers: 1.26K
Description from channel

Hot data science related posts every hour. Chat: https://telegram.me/r_channels
Contacts: @lgyanf

Ratings & Reviews

1.67

3 reviews

Reviews can be left only by registered users. All reviews are moderated by admins.

5 stars

0

4 stars

0

3 stars

0

2 stars

2

1 stars

1


The latest Messages 11

2022-05-31 06:14:56
52 views03:14
Open / Comment
2022-05-31 06:14:37 Nugget Ice VS Bullet Ice

https://redd.it/v1f3ql
@datascientology
50 views03:14
Open / Comment
2022-05-31 05:14:52 Free zip code database with Census data

A couple weeks ago, I shared my site, EverythingByZipCode.com, which is a zip code database that spans nearly 900 columns wide across multiple public government sites. I posted it to get feedback on the database and the general concept.

Link of the original post:

https://www.reddit.com/r/datasets/comments/uh6g2b/free_zip_code_database_800_columns/

Based on some feedback, I’ve added a free option of the same database, but a slimmed down, standard version that isn’t as extensive. It’s actually the same database that cost $40-$50 on other sites, like zip-codes.com.

Free option:

https://www.everythingbyzipcode.com/product/free-zip-code-database-lookup-file

It’s free and up to date! Enjoy!

If you have any recommendations, feel free to DM me on Twitter @bresslertweets.

David

/r/datasets
https://redd.it/uzo90t
54 views02:14
Open / Comment
2022-05-31 04:14:53
How to Tell a Raven from a Crow

/r/Infographics
https://redd.it/v0m7rm
53 views01:14
Open / Comment
2022-05-31 02:14:56
Migration of doctors from source (orange) countries to destination (green) countries. Saluja, Rudolfson, Massenburg, Meara, Shrime/BMJ Global Health,

/r/MapPorn
https://redd.it/v0zbc2
58 views23:14
Open / Comment
2022-05-31 01:15:01 I don't get the many shady location data providers if there is Google Popular Times and Open Street Map that you can access with ease and drive similar conclusions.

location data providers are often in the press with negative headlines. Those services aggregate movement data from apps and aggregate the data to derive movement patterns which might be helpful for marketers. In fact, I had two moments in my life where I evaluated a PoC with those location data brokers.

1. They were all shady about where the data comes from which is important to understand the Bias of the data. I never got a good answer.
2. The data often just represented < 0.4% of the population (at least in Europe - different game in the USA). For a big city they might have 20K unique users while in the city were more than 3M users living.
3. They dismiss any professional data analytics principle. The data comes in CSV (if a lot of data they give you like 10 separate files). Data was not always plausible in itself

Those experiences brought me to build certain parts of those data brokers but only with open-source data:

1. If it is about location data you should know OpenStreetMap. It's the biggest Database with meta info on location. It's not perfect but big companies like Mapbox, Apple, and Microsoft rely on it. Since the API is kind of messy, you can load with this repository whole cities information smoothly into a PostGres --> https://github.com/kuwala-io/kuwala/blob/master/kuwala/pipelines/osm-poi/README.md

2. Googe Popular Times: Movement data can be also found on Google. When you search a location it is often shown how frequently a place was visited (on an index of 0-100). With this libary you can access all the Popular Times data for location and entire cities --> https://github.com/kuwala-io/kuwala/blob/master/kuwala/pipelines/google-poi/README.md


3. Global Admin Boundaries: A huge problem that often people feel when working with location data is aggregating the data into different geo-based slices (country level, admin level, or even smaller into sub-districts). Here is a repo that cleaned the data out of Open Street Map for geo boundaries worldwide from very broad to a very small granularity --> https://github.com/kuwala-io/kuwala/blob/master/kuwala/pipelines/admin-boundaries/README.md

I think with those Open Source Tools and some data science magic you can generate similar outcomes as those location data providers but totally anonymized and free. Would be awesome if anybody is interested in building a case around it :-)

/r/datasets
https://redd.it/v1192a
56 views22:15
Open / Comment
2022-05-31 00:14:42
[OC] Total Cumulative Alcohol Volume Consumed During a Day of Drinking with Friends

/r/dataisbeautiful
https://redd.it/v0yq2d
61 views21:14
Open / Comment
2022-05-30 23:15:00 Dataset for global temperature & precipitation projection levels. CSV if possible.

The Climate Change Knowledge Portal has an option for downloading projections from 2005 to 2100, but the download only displays the historical periods. https://climateknowledgeportal.worldbank.org/download-data Does this happen for anyone else? I am trying to download the aggregated annual time series data at the national + subnational levels.

Are there any other dataset options to download data for temp? & precip. projections in CSV?

/r/datasets
https://redd.it/v0k8w5
66 views20:15
Open / Comment
2022-05-30 22:14:59 D What do you value in a paper replication?

Context: Recently read a paper from a few years back that I thought was pretty cool. Ended up replicating the implementation on github, because (1) I believe the idea should be made more accessible, and (2) as good old fashioned practice. Throughout the time spent working on it, replicating training results was dead last in priority, and I nearly forgot about it before considering the exercise complete.

Thus my curiosity: r/MachineLearning, what do you value in a paper replication?

P.S.: Might as well link the repo while I'm here. Happy to hear any feedback!

/r/MachineLearning
https://redd.it/v100ix
63 views19:14
Open / Comment
2022-05-30 21:14:58
Will you be offered food when you are a guest

/r/MapPorn
https://redd.it/v0xi5l
60 views18:14
Open / Comment