Connect with us

Artificial Intelligence

Financial data from users’ tax filing websites has been sent to Facebook





Services including TaxAct, TaxSlayer, and H&R Block were discovered transferring confidential data by the Markup.
The Markup has found that major tax preparation companies including H&R Block, TaxAct, and TaxSlayer have been covertly sending private financial data to Facebook when Americans file their taxes online.

Data on names and email addresses as well as frequently even more specific information, such as information on users’ income, filing status, refund amounts, and dependents’ college scholarship amounts, are provided through a commonly used code known as the Meta Pixel.
Whether or whether the person utilizing the tax filing service has an account on Facebook or other platforms run by its owner Meta, the information transmitted to the firm can be utilized to power its advertising algorithms.

The Internal Revenue Service processes around 150 million computerized individual tax returns each year, and The Markup discovered that the pixel is used by some of the most popular e-filing firms.

For instance, users of the well-known service TaxAct are required to give personal information in order to calculate their returns, such as their income and investment information. According to an examination by The Markup, a pixel on TaxAct’s website then relayed some of that information to Facebook, including users’ filing status, their AGI, and the size of their return. Refunds were rounded up to the nearest hundred and income to the nearest thousand. Additionally, the pixel transmitted dependents’ names in an obscured but typically reversible manner.

The Markup discovered comparable financial data—but not names—being provided to Google through its service by TaxAct, which claims to have about 3 million “consumer and professional users” on its website.
The Meta Pixel was used by other tax filing services besides TaxAct. The world’s largest provider of tax preparation services, H&R Block, which additionally provides an online filing option that draws millions of customers each year, integrated a pixel on its website that collected data on users’ use of health savings accounts and the grants and costs associated with dependents’ college tuition.

As part of Facebook’s “advanced matching” system, which collects information on website visitors in an effort to connect them to Facebook accounts, TaxSlayer, a different popular filing service, submitted personal information to the social media corporation. Phone numbers, the name of the user filling out the form, and the names of any dependents added to the return were among the data collected by the pixel on TaxSlayer’s website. Similar to TaxAct, Facebook was able to link a user to an existing profile despite the fact that precise demographic information about the person was obscured. According to TaxSlayer, 10 million federal and state tax returns were completed last year.

The Markup also discovered the pixel code on a tax preparation website run by Ramsey Solutions, a firm that provides software and financial planning services and makes use of a TaxSlayer service. From a tax return summary page, that pixel collected even more sensitive data, such as details on income and refund amounts. This information was only supplied after users clicked drop-down headings to view more of their report’s details on the website.
The pixel was used by even Intuit, the business that handles America’s leading online file system. However, Intuit’s TurboTax only sent usernames and the most recent sign-in time to Meta instead of financial information. The company completely removed the pixel from all sites after sign-in.

The protection of our customers’ data is something we take very seriously, according to Nicole Coburn, a TaxAct spokesman, in an email. “TaxAct always attempts to abide by all IRS laws.” The business “frequently evaluate[s] our processes as part of our continuous commitment to privacy, and will assess the information,” according to H&R Block spokesperson Angela Davied.

In an email, Ramsey Solutions spokesperson Megan McConnell stated that the business “installed the Meta Pixel to give a more tailored customer experience.”

The statement read, “We did NOT know and were never told that Facebook was collecting personal tax information through the Pixel.” We quickly notified TaxSlayer to deactivate the Pixel from Ramsey SmartTax as soon as we learned of it.

TaxSlayer removed the pixel to assess its use, according to spokesman Molly Richardson, who responded to The Markup’s email. She said that Ramsey Solutions “decided to remove the pixel” as well, stressing that “our customers’ privacy is of the utmost importance” and that “we take concerns regarding our customers’ information extremely seriously.”

While Intuit “may share some non-tax-return information, such as username, with marketing partners to deliver a better customer experience,” like not showing Intuit ads on Facebook to people who have accounts already, the company’s pixel “does not track, gather, or share information that users enter in TurboTax while filing their taxes.” The business claimed to be in accordance with laws but changed the pixel so that usernames are no longer sent.
The Markup’s results, according to Mandi Matlock, a tax law lecturer at Harvard Law School, reveal that taxpayers are “giving some of the most sensitive information that they own, and it’s being exploited.”

This is horrible, she remarked. “It is, really.”

After The Markup approached TaxAct for comment, the company’s website stopped sending financial information to Meta on Monday, but it still received dependents’ identities. The website kept sending Google Analytics money-related data. Additionally, as of Monday, TurboTax ceased sending usernames through the pixel at sign-in, and TaxSlayer and Ramsey Solutions disabled the pixel from their tax filing websites. The website for H&R Block was still disseminating information on college tuition assistance and health savings accounts.

How Meta Pixel monitors users
Anyone who wants the pixel code can get it for free from Meta, which gives companies the freedom to use it wherever they want on their websites.

The businesses and Facebook both benefit from using the code. When a customer visits a company’s website, the pixel may keep track of the things they browsed, like a T-shirt, for instance. The company can locate an audience that could already be interested in its items by targeting its Facebook advertisements to people who looked at that shirt.
Meta also benefits financially. The business claims it can use the information it gathers from devices like the pixel to power its algorithms, giving it knowledge of people’ online behaviors.

Facebook has seen success with this tactic. The business informed Congress in 2018 that there were over 2 million pixels on the web, a significant data collection effort that most internet users never saw.

The technique is widespread, according to Jon Callas, director of public interest technology at the Electronic Frontier Foundation, who described his reaction to The Markup’s findings as “shock but not surprise.”

The Markup’s analysis of sensitive data collection shows that some of it is related to the Meta Pixel’s default behaviors, while other instances appear to be the result of customizations made by tax filing services, people working on their behalf, or other software that has been installed on the website.

For instance, the normal setup of the Meta Pixel automatically collects the title of a page the user is seeing, along with the web address of the page and other data. This is how Meta Pixel gathered health savings account and college spending information from the H&R Block website. It was able to obtain salary data from Ramsey Solutions because it was presented as a summary that could be enlarged by clicking. The pixel identified the summary as being a button, because by default, the pixel captures text from a clicked button.
Automatic advanced matching was a feature used in the TaxSlayer and TaxAct pixels. This function examines forms for areas where it suspects there may be personally identifiable data, such as a phone number, first name, last name, or email address, and then transmits any such data it finds to Meta. This function on TaxSlayer’s website gathered contact information and the names of taxpayers and their dependents. It gathered dependents’ names on TaxAct.

According to Meta, the hashing method used to encrypt the data supplied by the matching feature is done so in order to “help preserve user privacy.” The pre-obfuscated version of the data may, however, usually be found by the corporation. In fact, Meta specifically used the hashed data to connect additional pixel data to Facebook and Instagram identities.

When The Markup created a test pixel linked to a business account, this pixel functionality was disabled by default but could be enabled by selecting a toggle during setup.

A “custom event,” which is sent only if the pixel is specified outside of the default by a website operator or another application the website operator adds to their site, is what TaxAct used to send dollar figures like adjusted gross income to Meta. Inquiries about whether and why TaxAct configured the pixel in this way went unanswered.

There are restrictions on the kinds of data that Meta claims the pixel will allow it to gather. The corporation claims that it uses automatic filtering to block potentially sensitive data and does not want sensitive information, including financial data, delivered to it. According to its help center, providing information such as bank account or credit card details or “knowledge regarding an individual’s financial account or status” is prohibited.

Still, The Markup discovered that two tax sites supplied Facebook one specific form of banned data – income. TaxAct may have also been transmitting a parameter with the name “student loan interest” before the pixel started filtering it before it was delivered, according to data it supplied to Facebook.
The Markup monitored websites’ pixel usage from January to July of this year as part of the Pixel Hunt, a collaboration with Mozilla Rally. Participants in the initiative installed a browser extension that gave The Markup a copy of all the information given with Meta via the pixel.

Through data given by Pixel Hunt participants, The Markup first learned that tax preparers were disclosing sensitive information. The Markup subsequently created accounts on the businesses’ websites and used the “Network” portion of Chrome DevTools, a feature included with Google’s Chrome browser, to reproduce and validate the data.

The Markup discovered sensitive data transferred to Facebook earlier this year with the aid of Pixel Hunt participants on the Education Department’s federal student aid application website, crisis pregnancy websites, and the websites of prominent hospitals.

Because Meta gathers so much information, occasionally even the firm doesn’t know where it goes. In a leaked memo from Facebook’s privacy engineers earlier this year, Vice reported that the firm couldn’t guarantee it wouldn’t use specific data for specific objectives because it “does not have an acceptable level of control and explainability over how our systems use data.”
Facebook has “extensive systems and controls to handle data and comply with privacy standards,” a corporate spokeswoman claimed at the time, according to Vice.

Dale Hogan, a representative for Meta, referred to the organization’s policies on sensitive financial information in answer to The Markup’s inquiries over the use of the pixel by the tax websites.

Hogan stated in an email that advertisers “should not transmit sensitive information about people through our Business Tools.” “Doing so is against our regulations, and we train advertisers on how to set up Business tools correctly to avoid this,” the statement reads. Our technology is built to weed out any potentially sensitive information it can find.

An email from a Google representative, Jackie Berté, stated that the company “has strict policies against advertising to people based on sensitive information” and that Google Analytics data is “obfuscated, meaning it is not tied back to an individual.” Additionally, she added, “our policies prohibit customers from sending us data that could be used to identify a user.”

Tax data is strictly regulated by the IRS.
Between 2001 and 2019, Nina Olson, the executive director of the nonprofit Center for Taxpayer Rights, served as the Internal Revenue Service’s national taxpayer advocate, a position in the organization designed to represent the interests of taxpayers.

She helped draft the rules governing the disclosure of tax information as part of her responsibilities at the IRS. Olson stated that the IRS standards governing the use of data by private tax filing firms are “extremely stringent” on purpose.

According to the rules she helped create, tax preparers, including e-filing companies, are only permitted to use the information that taxpayers provide for certain limited purposes; anything beyond simply facilitating filing requires the user’s signed consent that specifies the recipient and the specific information being disclosed.

Even the font size of requests for disclosure is regulated by the government, which states that it must be “the same size as, or larger than, the typical or standard body text used by the website or software program.”

While Olson said she was not aware of any criminal cases that had been pursued, the penalty for sharing data without consent could be severe: fines and even jail time are possible.

The Markup searched the websites of tax preparation services for disclosures that expressly named Facebook or Meta, but it was unable to locate any. Some businesses, however, incorporated rather extensive disclosure agreements.

For instance, TaxAct asked customers to consent to the sibling firm, TaxSmart Research LLC, receiving their tax information so that it may “create, promote, and provide goods and services” for users. TaxSmart Research LLC may work with service providers and business partners to complete these responsibilities, it was further stated. In contrast, H&R Block included almost the same disclosure request so that “H&R Block Personalized Services, LLC” could offer its own products. Although users had the choice to opt out of sharing tax information with Facebook on certain sites, The Markup’s tests revealed that data was shared with Facebook regardless of the users’ choices.

According to Olson, any disclosure by a tax preparer must specify the precise objective and recipient in order to be in compliance. Do they have a list stating that they will reveal the return amounts, your children, and whatever else on Facebook? she questioned. If not, they might be breaking the law. Regarding whether any of the websites that shared tax information were in violation of the law, the IRS declined to comment or respond to any inquiries.

There is no escape for taxpayers
There aren’t many options available to American taxpayers outside using private businesses to file their taxes.

In contrast to other nations, the United States has a substantially privatized tax filing system that frequently necessitates the employment of outside tax preparers. In other nations, the taxpayers simply give their approval to the estimates that the government does. However, as a result of a successful lobbying campaign by private businesses, tax preparers in the US now serve as the official go-between for taxpayers and the government.

Today, tax preparation is a significant sector in the United States, worth more than $11 billion, according to market research.

Although there is a free preparation and filing alternative, it is only available to those making $73,000 or less and might be challenging to utilize. Companies are accused for not making the option easily accessible even when they provide their tax software at no cost as part of an agreement with the IRS.

The Markup discovered using the pixel that the IRS even successfully guides taxpayers attempting to file for free to some of the businesses. The Free File Alliance, an arrangement including a few tax preparation firms, includes TaxAct and TaxSlayer. H&R Block and TurboTax have previously participated in the program.

Harvard’s Matlock claimed that The Markup’s findings demonstrated the nearly unavoidable implications of entrusting a government requirement to for-profit businesses. According to her, the procedure leaves users with no alternative but to give their data to Facebook in order to comply with the law.

It’s aggravating, she added, since taxpayers are being forced into the hands of these private, for-profit businesses in order to fulfill their tax filing duties. “Really, we don’t have a choice in the issue.”

As Editor here at GeekReply, I'm a big fan of all things Geeky. Most of my contributions to the site are technology related, but I'm also a big fan of video games. My genres of choice include RPGs, MMOs, Grand Strategy, and Simulation. If I'm not chasing after the latest gear on my MMO of choice, I'm here at GeekReply reporting on the latest in Geek culture.

Artificial Intelligence

Gaming models are created by Auctoria using generative AI





Aleksander Caban, co-founder of Polish VR game developer Carbon Studio, noticed a major problem in modern game design several years ago. He manually created rocks, hills, paths, and other video game environment elements, which was time-consuming and laborious.

Caban created tech to automate the process.

In collaboration with Michal Bugała, Joanna Zając, Karolina Koszuta, and Błażej Szaflik, he founded Auctoria, an AI-powered platform for creating 3D game assets. Auctoria, from Gliwice, Poland, is in Startup Battlefield 200 at Disrupt 2023.

Auctoria was founded on a passion for limitless creativity, according to Zając in an email interview. It was designed to help game developers, but anyone can use it. Few advanced tools exist for professionals; most are for hobbyists and amateurs. We want to change that.”

Using generative AI, Auctoria creates various video game models. One feature generates basic 3D game levels with pathways, while another converts uploaded images and textures of walls, floors, and columns into 3D versions.

Like DALL-E 2 and Midjourney, Auctoria can generate assets from text prompts. Or they can submit a sketch, which the platform will try to turn into a digital model.


All AI algorithms and training data for Auctoria were developed in-house, according to Zając.

She said “Auctoria is based 100% on our content, so we’re not dependent on any other provider.” It’s independent—Auctoria doesn’t use open source or external engines.

In the emerging market for AI game asset generation tools, Auctoria isn’t alone. The 3DFY, Scenario, Kaedim, Mirage, and Hypothetic startups create 3D models. Even Nvidia and Autodesk are entering the space with apps like Get3D, which converts images to 3D models, and ClipForge, which generates models from text descriptions.

Meta also tried tech to create 3D assets from prompts. In December, OpenAI released Point-E, an AI that synthesizes 3D models for 3D printing, game design, and animation.

Given the size of the opportunity, the race to market new solutions isn’t surprising. According to Proficient Market Insights, 3D models could be worth $3.57 billion by 2028.

According to Zając, Auctoria’s two-year R&D cycle has led to a more robust and comprehensive toolset than rivals.

“Currently, AI-based software is lacking for creating complete 3D world models,” Zając stated. “3D editors and plugins offer only a fraction of Auctoria’s capabilities. Our team started developing the tool two years ago, giving us a ready-to-use product.”

Auctoria, like all generative AI startups, must deal with AI-generated media legal issues. Not yet clear how AI-generated works can be copyrighted in the U.S.

However, the Auctoria team of seven employees and five co-founders is delaying answering those questions. Instead, they’re piloting the tooling with game development studios like Caban’s Carbon Studio.

Before releasing Auctoria in the coming months, the company hopes to raise $5 million to “speed up the process” of creating back-end cloud services to scale the platform.

Zając stated that the funding would reduce the computing time required for creating worlds or 3D models with Auctoria. Achieving a software-as-a-service model requires both infrastructure and user experience enhancements, such as a simple UI, excellent customer service, and effective marketing. We’ll keep our core team small, but we’ll hire more by year’s end.”

Continue Reading

Artificial Intelligence

DALL-E 3, from OpenAI, lets artists skip training





Today, OpenAI released an updated version of DALL-E, its text-to-image tool that uses ChatGPT, its viral AI chatbot, to make prompting easier.

Most modern, AI-powered image generation tools turn prompts—image descriptions—into photorealistic or fantastical artwork. However, writing the right prompt is so difficult that “prompt engineering” is becoming a profession.

New OpenAI tool DALL-E 3 uses ChatGPT to fill prompts. OpenAI’s premium ChatGPT plans, ChatGPT Plus and ChatGPT Enterprise, allow users to type in an image request and refine it with the chatbot, receiving the results in the chat app.

ChatGPT can make a few-word prompt more descriptive, guiding the DALL-E 3 model.

DALL-E 3 adds more than ChatGPT integration. OpenAI claims that DALL-E 3 produces better images that better reflect prompts, especially for longer prompts. It handles text and human hands better, which have previously hampered image-generating models.


OpenAI claims DALL-E 3 has new algorithmic bias-reduction and safety mechanisms. For instance, DALL-E 3 will reject requests to depict living artists or public figures. Artists can now choose not to train future OpenAI text-to-image models with their work. (OpenAI and its rivals are being sued for using copyrighted artists’ work to train their generative AI image models.)

As the image-synthesizing generative AI race heats up, DALL-E 3 launches. Midjourney and Stability AI keep improving their image-generating models, putting pressure on OpenAI to keep up.

OpenAI will release DALL-E 3 to premium ChatGPT users in October, then research labs and API customers. The company did not say when or if it would release a free web tool like DALL-E 2 and the original model.

Continue Reading

Artificial Intelligence

Open-source Microsoft Novel protein-generating AI EvoDiff





All diseases are based on proteins, natural molecules that perform vital cellular functions. Characterizing proteins can reveal disease mechanisms and ways to slow or reverse them, while creating proteins can lead to new drug classes.

The lab’s protein design process is computationally and human resource-intensive. It involves creating a protein structure that could perform a specific function in the body and then finding a protein sequence that could “fold” into that structure. To function, proteins must fold correctly into three-dimensional shapes.

Not everything has to be complicated.

Microsoft introduced EvoDiff, a general-purpose framework that generates “high-fidelity,” “diverse” proteins from protein sequences, this week. Unlike other protein-generating frameworks, EvoDiff doesn’t need target protein structure, eliminating the most laborious step.

Microsoft senior researcher Kevin Yang says EvoDiff, which is open source, could be used to create enzymes for new therapeutics, drug delivery, and industrial chemical reactions.

Yang, one of EvoDiff’s co-creators, told n an email interview that the platform will advance protein engineering beyond structure-function to sequence-first design. EvoDiff shows that ‘protein sequence is all you need’ to controllably design new proteins.

A 640-million-parameter model trained on data from all protein species and functional classes underpins EvoDiff. “Parameters” are the parts of an AI model learned from training data that define its skill at a problem, in this case protein generation. The model was trained using OpenFold sequence alignment data and UniRef50, a subset of UniProt, the UniProt consortium’s protein sequence and functional information database.

Modern image-generating models like Stable Diffusion and DALL-E 2 are diffusion models like EvoDiff. EvoDiff slowly subtracts noise from a protein made almost entirely of noise to move it closer to a protein sequence.


Beyond image generation, diffusion models are being used to design novel proteins like EvoDiff, create music, and synthesize speech.

“If there’s one thing to take away [from EvoDiff], I think it’s this idea that we can — and should — do protein generation over sequence because of the generality, scale, and modularity we can achieve,” Microsoft senior researcher Ava Amini, another co-contributor, said via email. “Our diffusion framework lets us do that and control how we design these proteins to meet functional goals.”

EvoDiff can create new proteins and fill protein design “gaps,” as Amini noted. A protein amino acid sequence that meets criteria can be generated by the model from a part that binds to another protein.

EvoDiff can synthesize “disordered proteins” that don’t fold into a three-dimensional structure because it designs proteins in “sequence space” rather than structure. Disordered proteins enhance or decrease protein activity in biology and disease, like normal proteins.

EvoDiff research isn’t peer-reviewed yet. Microsoft data scientist Sarah Alamdari says the framework needs “a lot more scaling work” before it can be used commercially.

“This is just a 640-million-parameter model, and we may see improved generation quality if we scale up to billions,” Alamdari emailed. WeAI emonstrated some coarse-grained strategies, but to achieve even finer control, we would want to condition EvoDiff on text, chemical information, or other ways to specify the desired function.”

Next, the EvoDiff team will test the model’s lab-generated proteins for viability. Those who are will start work on the next framework.

Continue Reading