What is Google’s new policy for AI training?

Google updates its Privacy Policy  to use Public Data for AI Training

In a recent policy update, tech giant Google decided to collect data from all available sources on the internet to train its AI models, including Bard.

Under the new policy, Google will be able to collect data from a variety of public sources, including social media posts, government records, and websites. This data will be used to train AI models for a variety of purposes, such as spam filtering, fraud detection, and language translation.

Google argues that using public data is necessary to train AI models accurately and effectively. The company also said it will take steps to protect user privacy, such as removing identifying data before using it to train models.

Google’s Policy page states: “We may share non-personally identifiable information publicly and with our partners — such as publishers, advertisers, developers or rights owners. For example, we share information publicly to show trends in overall usage of our services.”

What is Google’s new policy?

Google’s policy on public data collection is not very transparent, so users must read the policy carefully to understand what information Google collects.

Here’s what the policy update refers to, “Google uses information to improve our services and develop new products, features and technologies that benefit users and the public.

“For example, we may collect information publicly available online or from other public sources to help train Google AI models and build products and features like Google Translate capabilities , Bard and Cloud AI. Or, if your business information appears on a website, we may index and display it on Google services. it added.

Previously, the company used this information to update and train language models to improve existing products like Google Translate. Now, the company has clearly mentioned that all public data will be used to update its AI products.

Jagranjosh

Source: Google

The image above is taken from Google’s policy repository, where the green color represents newly added information.

The dangers of data scraping

This new policy update may cause serious data collection and privacy concerns. While companies typically keep user data confidential for future use and new product development, Google’s new policy allows the company to use any publicly available information to train models. My AI.

This means that Google can access and process any type of data available on the internet, including personal information. The company mentioned that it did not identify the sources but it could still cause trouble.

First, it may violate an individual’s right to privacy. When data is collected without permission, individuals may not be aware that their data is being collected or how it is being used. This can lead to a number of problems, such as identity theft and financial fraud.

Second, data scraping can be used to create biased AI models. If AI models are trained on data pulled from the Internet, the models may reflect biases that are already present in the data. This can lead to AI models that discriminate against certain groups of people.

Finally, data collection can disrupt the Internet. The most recent example of this is the Twitter outage. When data is pulled from websites, it can slow down websites and make them difficult to use.

Elon Musk expressed his concerns about data collection and he decided to limit the number of tweets people can read per day. He is also continuously working to make the platform more secure by monetizing various services.

In short, the new policy can certainly help Google create powerful AI but it will also be a safety hazard. These policies may also lead to increased data collection and privacy violations. It’s important to carefully monitor how Google implements these policies.

Categories: Optical Illusion
Source: pagasa.edu.vn

Leave a Comment