🔥 Burn Fat Fast. Discover How! 💪

Big Data Science

Logo of telegram channel bdscience — Big Data Science B
Logo of telegram channel bdscience — Big Data Science
Channel address: @bdscience
Categories: Technologies
Language: English
Subscribers: 1.44K
Description from channel

Big Data Science channel gathers together all interesting facts about Data Science.
For cooperation: a.chernobrovov@gmail.com
💼 — https://t.me/bds_job — channel about Data Science jobs and career
💻 — https://t.me/bdscience_ru — Big Data Science [RU]

Ratings & Reviews

1.67

3 reviews

Reviews can be left only by registered users. All reviews are moderated by admins.

5 stars

0

4 stars

0

3 stars

1

2 stars

0

1 stars

2


The latest Messages 21

2021-06-25 06:10:07 Welcome to office with a smile!
Canon's biometric access control systems passes into Chinese offices and other work areas only those employees who smile: a smile identification function is built into the face recognition module in video cameras at the entrance. This is expected to enhance corporate spirit and employee loyalty.)
https://www.theverge.com/2021/6/17/22538160/ai-camera-smile-recognition-office-workers-china-canon
218 views03:10
Open / Comment
2021-06-23 06:33:20 Mathematics for the Data Scientist, Part 2: Zipf's Law
This empirical pattern of natural language word frequency distribution is often used in quantitative linguistics and NLP problems. Zipf's law says: if all words in a large text are ordered in descending order of frequency of their use, then the frequency of the n-th word in this list will be inversely proportional to its ordinal number n (rank). For example, the second most commonly used word occurs about two times less often than the first, the third - three times less often than the first, etc.
The pattern was first discovered by French stenographer Jean-Baptiste Estoux in 1908. In practice, the law was applied to describe the distribution of city sizes by the German physicist Felix Auerbach in 1913. And the American linguist George Zipf actively popularized this pattern in 1949, proposing to use it to describe the distribution of economic forces and social status: the richest person has twice as much money as the next rich man, etc. An explanation of Zipf's law based on the correlation properties of additive Markov chains (with a step memory function) was given in 2005. Mathematically, Zipf's law is described by the Pareto distribution (the well-known 80 to 20 principle).
The different areas of application of the law (not only linguistics) are explained by the American bioinformatics specialist Wentian Li, who proved that a random sequence of characters also obeys this Zipf's law. Scientist argues that Zipf's law is a statistical phenomenon that has nothing to do with the semantics of a text, and the probability of a random occurrence of any word of length n in a chain of random characters decreases with increasing n in the same proportion as the rank of this word in the frequency list (ordinal scale). Therefore, the multiplication of the rank of a word by its frequency is a constant.
147 views03:33
Open / Comment
2021-06-21 05:25:21 Communication with photo and video bots: a study of how people reflect the emotions of virtual interlocutors, trusting their appearance and emotions. The conclusions of the scientists will surprise you: mirroring is not at all an indicator of a pleasant conversation, but an indicator of the difficulty in understanding an opponent.
https://techxplore.com/news/2021-06-features-virtual-agents-affect-humans.html
253 views02:25
Open / Comment
2021-06-18 06:36:27 ML-leader from Sber in IT World Awards 2021
The cloud platform ML Space from SberCloud (Sber) was recognized as the best Data Science and AI product in the world according to the organizers of the IT World Awards 2021, the Globee Awards. ML Space is a powerful MLOps data tool supporting all processes in the ML-model development cycle, including testing and deployment. The platform integrates all the necessary frameworks and libraries that allow you to speed up, optimize and simplify the processes of creating ML products.
https://globeeawards.com/it-world-awards/winners/
224 views03:36
Open / Comment
2021-06-16 05:49:25 How to create your own Deep Fake without the long training of neural networks? It's easy!
Try 4 free services:
- https://reface.app/
- https://avatarify.ai/
- https://www.wombo.ai/
- https://www.myheritage.com/deep-nostalgia?lang=RU
You need only upload a photo or make a selfie from your mobile phone to get a believable video of another person's face. For example, MyHeritage, a genealogical website, allows reanimate died people by generating mini-videos of them watching and smiling. However, remember that generating fakes to compromise someone can be considered libel and punishable by law!
163 views02:49
Open / Comment
2021-06-14 05:07:48 Improve image quality using neural networks? No problem! ML-technology DeepHD from Yandex: GAN-networks make images better, removing interference and noise from them. https://yandex.ru/company/technologies/deephd/
194 viewsedited  02:07
Open / Comment
2021-06-11 08:09:42 Smart clothes is a new fashion in the coming years: digital ML fabric from MIT researchers with memory, temperature sensors and a trained neural network
MIT has created the first digitally capable fiber that can detect, store, analyze, and measure physical activity after being sewn into a shirt. Digital fibers enhance the ability of tissues to detect hidden structures in the human body, which can be used to monitor physical performance, medical reports and early detection of diseases, as well as retain impressions. For example, memorize wedding music in the dress you were wearing that day.
Until now, electronic fibers have been analog, carrying a continuous electrical signal, not digital. This is the first implementation of a structure with the ability to digitally store and process data, allowing a new dimension of content to be added to textiles to program fabrics.
The new fiber was created by placing hundreds of square silicon digital microchips in a preform, which was then used to create a polymer fiber. By precisely controlling the flow of the polymer, the researchers were able to create a fiber with a continuous electrical connection between the chips for tens of meters.
The fiber itself is thin and flexible, it can be passed through a needle, sewn into fabric and washed at least 10 times without breaking, and it is also not felt at all. Thanks to the digital addressing method, it is possible to include the functionality of one element without affecting the rest of the elements. Digital fiber can also store a large amount of information in memory. The researchers were able to record, store and read information about the fiber, including a 767-kilobyte full-color short video file and a 0.48-megabyte music file. Files can be stored for two months without power.
The fiber includes a neural network of 1,650 connections in tissue memory, which can be trained on data in real time directly on a person, analyzing information about body temperature taking into account physical activity. Thanks to this, in the future, clothing will be able to detect and warn people in real time about changes in health indicators (respiratory and heart rate) and transmit data about muscle activation to athletes during training. Now the smart fabric is controlled by a small external device, and in the future it is planned to develop a new chip as a microcontroller connected to the fiber itself.
https://news.mit.edu/2021/programmable-fiber-0603
213 views05:09
Open / Comment
2021-06-09 06:23:48 Mathematics for the Data Scientist, Part 1: Benford's Law
Benford's (Newcomb-Benford) law of the first digit describes the probability of occurrence of a certain first significant digit in distributions of quantities taken from real life. This mathematical law is true for many distributions, it allows you to predict the frequency of occurrence of the second and third digits in the dataset.
For the first time this law was discovered by the American astronomer Simon Newcome in 1881, analyzing the degree of wear and tear of book pages. And in 1938, physicist Frank Benford made similar conclusions based on the results of the analysis of tables on the characteristics of rivers, chemical compounds and house numbers in the city directory. An analysis of numbers showed that one is the first significant digit with a probability of not 1/9, as it seems at first glance, but about 1/3.
Benford's Law applies to sets of numbers that can grow exponentially, i.e. the rate of growth of a value is proportional to its current value, for example, stock balances in warehouses, stock prices, population size, length of rivers, area of countries.A set of numbers satisfies Benford's law if the first digit d (𝑑∈1,…, 9) occurs in the equation. Using this distribution, you can predict which digit occurs most frequently in the dataset. The law usually does not apply for distributions with given minimum or maximum values, as well as those that cover only one or two orders of magnitude. Also Benford's law does not apply to texts. The sample size for the law of the first digit should be sufficient to apply statistical methods. In practice, the first digit law is applied in applications for detecting fraud in tax forms, election results, economic performance and accounting data.
221 views03:23
Open / Comment
2021-06-07 06:39:28 AI in retail: 7 examples
eBay
- Pricing and stockpiling, optimizing the appearance of product cards to increase appeal and increase sales
Sephora uses a color matching recommendation system for color cosmetics (lipstick, eyeshadow and powder): the camera scans the skin color, analyzes the data and generates a unique color number and selects the product from the product line that suits this client best.
Tesco - inventory management: forecasting and replenishing with weather and regional characteristics, as well as data from CCTV cameras directed to store shelves. And routing ML-algorithms help buy faster in Tesco Online.
OTTO - Predicting future purchases based on the analysis of 3 billion historical transactions and 200 additional variables (weather, website searches, etc.). The accuracy of the forecast of which product will be sold within a month reaches 90%. This helps to optimize warehouse stocks and increase product turnover.
Simbe Robotics creates robots that detect violation of the plan for the placement of goods, their lack of goods and non-compliance with price tags using computer vision systems. She not only recognizes products, but also recommends how to fill them.
Vekia has developed a supply chain management solution for Leroy Merlen, Etam, Okaidi and Jacadi: control of goods in each store with daily assortment assessments. The system calculates the optimal stock level for each location several times a day and can automatically generate an order for the required items.
Diwo - determination of the factors of decrease in sales for individual products. The ML system also offers a set of recommended strategies for improving the situation, suggesting the ideal time to start promotions and other attributes of advertising campaigns.
185 views03:39
Open / Comment
2021-06-04 07:06:49 Looking for something interesting? Welcome to open and free webinars from MIT professors:

1) Banach Space Representer Theorems for Neural Networks - Prof. Robert D. Nowak
https://www.csail.mit.edu/event/cbmm-special-seminar-banach-space-representer-theorems-neural-networks
June 8, 14:00 EST
https://mit.zoom.us/j/97306008379?pwd=OVR2MU1uNXcrcU5DZkRncmlnZndMZz09
Passcode: 289045

2) Next-generation recurrent network models for cognitive neuroscience - Guangyu Robert Yang
https://www.csail.mit.edu/event/cbmm-special-seminar-next-generation-recurrent-network-models-cognitive-neuroscience
June 15, 14:00 EST
https://mit.zoom.us/j/94734403753?pwd=YW5udzZJdndqVnc1NnkyQ0s3L0hVUT09
Passcode: 080128
245 views04:06
Open / Comment