Apple is reportedly working on a method to reduce the wake phrase for its voice assistant to just “Siri,” according to a report from Mark Gurman.
According to a report from Bloomberg’s Mark Gurman, Apple is seeking to replace “Hey Siri” with the word “Siri.” This means that to activate the smart assistant, all you would need to say is “Siri” followed by a command.
According to Gurman, Apple has been working on this feature for a while and is planning to release it either in 2019 or in 2024. To make the feature work effectively, Apple will need to invest in “substantial AI training and underlying engineering effort,” as the smart assistant would need to comprehend the single wake word in a variety of accents and dialects. The two-word trigger phrase “Hey Siri” makes Siri more likely to recognize it.
Since Alexa from Amazon currently allows users to activate the smart assistant by saying “Alexa” rather than “Hey Alexa,” switching to a single wake word could help Siri stay up with Alexa. Additionally, it would put Siri ahead of Google Assistant, which needs to be activated by saying “Ok Google” or “Hey Google” (although you don’t need to repeat the wake word while speaking consecutive requests). Before discontinuing the voice assistant last year, even Microsoft changed the smart speaker command “Hey Cortana” to “Cortana.”
There will be other changes to Siri in addition to the shorter trigger phrase. Gurman claims that Apple might enhance Siri’s comprehension and processing of user queries while also further integrating it into third-party apps and services. The voice assistant recently received some minor updates from Apple, including the addition of a new voice recorded by a member of the LGBTQ+ community and a new Siri activation sound.
Gaming models are created by Auctoria using generative AI
Aleksander Caban, co-founder of Polish VR game developer Carbon Studio, noticed a major problem in modern game design several years ago. He manually created rocks, hills, paths, and other video game environment elements, which was time-consuming and laborious.
Caban created tech to automate the process.
In collaboration with Michal Bugała, Joanna Zając, Karolina Koszuta, and Błażej Szaflik, he founded Auctoria, an AI-powered platform for creating 3D game assets. Auctoria, from Gliwice, Poland, is in Startup Battlefield 200 at Disrupt 2023.
Auctoria was founded on a passion for limitless creativity, according to Zając in an email interview. It was designed to help game developers, but anyone can use it. Few advanced tools exist for professionals; most are for hobbyists and amateurs. We want to change that.”
Using generative AI, Auctoria creates various video game models. One feature generates basic 3D game levels with pathways, while another converts uploaded images and textures of walls, floors, and columns into 3D versions.
Like DALL-E 2 and Midjourney, Auctoria can generate assets from text prompts. Or they can submit a sketch, which the platform will try to turn into a digital model.
All AI algorithms and training data for Auctoria were developed in-house, according to Zając.
She said “Auctoria is based 100% on our content, so we’re not dependent on any other provider.” It’s independent—Auctoria doesn’t use open source or external engines.
In the emerging market for AI game asset generation tools, Auctoria isn’t alone. The 3DFY, Scenario, Kaedim, Mirage, and Hypothetic startups create 3D models. Even Nvidia and Autodesk are entering the space with apps like Get3D, which converts images to 3D models, and ClipForge, which generates models from text descriptions.
Meta also tried tech to create 3D assets from prompts. In December, OpenAI released Point-E, an AI that synthesizes 3D models for 3D printing, game design, and animation.
Given the size of the opportunity, the race to market new solutions isn’t surprising. According to Proficient Market Insights, 3D models could be worth $3.57 billion by 2028.
According to Zając, Auctoria’s two-year R&D cycle has led to a more robust and comprehensive toolset than rivals.
“Currently, AI-based software is lacking for creating complete 3D world models,” Zając stated. “3D editors and plugins offer only a fraction of Auctoria’s capabilities. Our team started developing the tool two years ago, giving us a ready-to-use product.”
Auctoria, like all generative AI startups, must deal with AI-generated media legal issues. Not yet clear how AI-generated works can be copyrighted in the U.S.
However, the Auctoria team of seven employees and five co-founders is delaying answering those questions. Instead, they’re piloting the tooling with game development studios like Caban’s Carbon Studio.
Before releasing Auctoria in the coming months, the company hopes to raise $5 million to “speed up the process” of creating back-end cloud services to scale the platform.
Zając stated that the funding would reduce the computing time required for creating worlds or 3D models with Auctoria. Achieving a software-as-a-service model requires both infrastructure and user experience enhancements, such as a simple UI, excellent customer service, and effective marketing. We’ll keep our core team small, but we’ll hire more by year’s end.”
DALL-E 3, from OpenAI, lets artists skip training
Today, OpenAI released an updated version of DALL-E, its text-to-image tool that uses ChatGPT, its viral AI chatbot, to make prompting easier.
Most modern, AI-powered image generation tools turn prompts—image descriptions—into photorealistic or fantastical artwork. However, writing the right prompt is so difficult that “prompt engineering” is becoming a profession.
New OpenAI tool DALL-E 3 uses ChatGPT to fill prompts. OpenAI’s premium ChatGPT plans, ChatGPT Plus and ChatGPT Enterprise, allow users to type in an image request and refine it with the chatbot, receiving the results in the chat app.
ChatGPT can make a few-word prompt more descriptive, guiding the DALL-E 3 model.
DALL-E 3 adds more than ChatGPT integration. OpenAI claims that DALL-E 3 produces better images that better reflect prompts, especially for longer prompts. It handles text and human hands better, which have previously hampered image-generating models.
OpenAI claims DALL-E 3 has new algorithmic bias-reduction and safety mechanisms. For instance, DALL-E 3 will reject requests to depict living artists or public figures. Artists can now choose not to train future OpenAI text-to-image models with their work. (OpenAI and its rivals are being sued for using copyrighted artists’ work to train their generative AI image models.)
As the image-synthesizing generative AI race heats up, DALL-E 3 launches. Midjourney and Stability AI keep improving their image-generating models, putting pressure on OpenAI to keep up.
OpenAI will release DALL-E 3 to premium ChatGPT users in October, then research labs and API customers. The company did not say when or if it would release a free web tool like DALL-E 2 and the original model.
Open-source Microsoft Novel protein-generating AI EvoDiff
All diseases are based on proteins, natural molecules that perform vital cellular functions. Characterizing proteins can reveal disease mechanisms and ways to slow or reverse them, while creating proteins can lead to new drug classes.
The lab’s protein design process is computationally and human resource-intensive. It involves creating a protein structure that could perform a specific function in the body and then finding a protein sequence that could “fold” into that structure. To function, proteins must fold correctly into three-dimensional shapes.
Not everything has to be complicated.
Microsoft introduced EvoDiff, a general-purpose framework that generates “high-fidelity,” “diverse” proteins from protein sequences, this week. Unlike other protein-generating frameworks, EvoDiff doesn’t need target protein structure, eliminating the most laborious step.
Microsoft senior researcher Kevin Yang says EvoDiff, which is open source, could be used to create enzymes for new therapeutics, drug delivery, and industrial chemical reactions.
Yang, one of EvoDiff’s co-creators, told n an email interview that the platform will advance protein engineering beyond structure-function to sequence-first design. EvoDiff shows that ‘protein sequence is all you need’ to controllably design new proteins.
A 640-million-parameter model trained on data from all protein species and functional classes underpins EvoDiff. “Parameters” are the parts of an AI model learned from training data that define its skill at a problem, in this case protein generation. The model was trained using OpenFold sequence alignment data and UniRef50, a subset of UniProt, the UniProt consortium’s protein sequence and functional information database.
Modern image-generating models like Stable Diffusion and DALL-E 2 are diffusion models like EvoDiff. EvoDiff slowly subtracts noise from a protein made almost entirely of noise to move it closer to a protein sequence.
Beyond image generation, diffusion models are being used to design novel proteins like EvoDiff, create music, and synthesize speech.
“If there’s one thing to take away [from EvoDiff], I think it’s this idea that we can — and should — do protein generation over sequence because of the generality, scale, and modularity we can achieve,” Microsoft senior researcher Ava Amini, another co-contributor, said via email. “Our diffusion framework lets us do that and control how we design these proteins to meet functional goals.”
EvoDiff can create new proteins and fill protein design “gaps,” as Amini noted. A protein amino acid sequence that meets criteria can be generated by the model from a part that binds to another protein.
EvoDiff can synthesize “disordered proteins” that don’t fold into a three-dimensional structure because it designs proteins in “sequence space” rather than structure. Disordered proteins enhance or decrease protein activity in biology and disease, like normal proteins.
EvoDiff research isn’t peer-reviewed yet. Microsoft data scientist Sarah Alamdari says the framework needs “a lot more scaling work” before it can be used commercially.
“This is just a 640-million-parameter model, and we may see improved generation quality if we scale up to billions,” Alamdari emailed. WeAI emonstrated some coarse-grained strategies, but to achieve even finer control, we would want to condition EvoDiff on text, chemical information, or other ways to specify the desired function.”
Next, the EvoDiff team will test the model’s lab-generated proteins for viability. Those who are will start work on the next framework.
- Gadgets8 years ago
Why the Nexus 7 is still a good tablet in 2015
- Mobile Devices8 years ago
Samsung Galaxy Note 4 vs Galaxy Note 5: is there room for improvement?
- Editorials8 years ago
Samsung Galaxy Note 4 – How bad updates prevent people from enjoying their phones
- Mobile Devices8 years ago
Nexus 5 2015 and Android M born to be together
- Gaming8 years ago
New Teaser For Five Nights At Freddy’s 4
- Mobile Devices8 years ago
Google not releasing Android M to Nexus 7
- Gadgets9 years ago
Moto G Android 5.0.2 Lollipop still has a memory leak bug
- Mobile Devices8 years ago
Nexus 7 2015: Huawei and Google changing the game