Guys! Recently, the news that Meta acquired Manus for $2 billion has caused a stir, but it can't be used in China?
Don't worry. Today, I'm going to recommend to you a domestic, free - of - charge local AI agent that doesn't require a VPN - Aipy! With features like open - source local operation, no - code needed, and multi - scenario practicality, it's simply an amazing AI labor - saving tool for ordinary people 🎉
🌟 🌟 Full - fledged Core Advantages
✅ Domestic Access without VPN: Runs locally, no need for a scientific internet access method.
✅ Completely Free: Register and use the invitation code to get 3.5 million Tokens (Invitation code: 4zfb).
✅ Zero - code Operation: Describe your needs in plain language, and the AI will automatically generate/ execute code.
✅ Agent Marketplace: Tools for quantitative research, photo - editing, PPT generation, etc. can be installed with one click.
Aipy integrates the large AI model with the Python program ecosystem. You don't need to know code at all. Just describe your needs in plain language, and it will automatically generate, debug, and execute programs in the background, and finally hand over the complete result to you.
The interface of Aipy is very simple: Enter your needs in the chat box on the left, and the right side will run and display the results in real - time. You just need to say what you want to do, and it will automatically generate and execute code to complete the full loop from instruction to result.
💡 Super - practical in Real - world Tests
Quantitative Research: Free access to historical data of A - shares / US stocks / Hong Kong stocks. Enter the stock name and it will automatically generate a technical analysis report.
Most stock analysis tools on the market require payment. However, Aipy has built - in historical market data of all listed companies in A - shares, US stocks, and Hong Kong stocks, and it can be used for free.
Install "Quantitative Research" in the "Agent Marketplace", click "Go to Use", and just tell it which stock you want to analyze. It will give a comprehensive analysis result from multiple aspects such as technical indicators, valuation levels, and trend status.
It should be emphasized that,the analysis given by the AI is more of a reference and learning tool, and the final investment decision still needs to be made by ourselves.
Batch Photo - editing: Upload a photo folder and let the AI batch - edit photos with one sentence.
First, install "Image Generation" in the "Agent Marketplace", and then click "Go to Use".
Ask Aipy to batch - edit the puppies in the folder into the way I want. It can easily understand natural language without complex prompts.
It completed my task in minutes without me writing any line of code. The generated pictures also have a very good effect.
PPT Generation: One - sentence requirement + Internet search, and a well - structured PPT can be done in minutes.
Aipy is also excellent at PPT generation. Just install "PPT Generation" in the "Agent Marketplace", click "Go to Use", and then state your requirement in one sentence.
For example, if I want it to help me make an introduction to the Xiaomi 17Ultra, as a newly - released product, the AI knowledge base may not have relevant information. We can turn on the Internet search function to let it obtain real - time information.联网搜索的,让它去实时获取。
After a while, a well - structured, complete - content, and clean - layout PPT is generated. From information organization to page presentation, it's done in one go, and the efficiency is remarkable.
Material Download: Throw in the link and it can batch - download website pictures and automatically classify and name them.
Aipy can also handle some more "hands - on" miscellaneous tasks. For example, to batch - download pictures from any website to the local device, just throw the link to it and state your needs.
It will automatically handle the download, classification, and naming, and the obtained files still maintain their original clarity.
If you're not satisfied with the results of some tasks, you can also manually select a more advanced model to execute.Aipy itself has multiple large models built- in and can switch flexibly according to different scenarios.
In addition to the functions mentioned above, the Agent Marketplace of Aipy also integrates many practical tools such as short - video copywriting generation, browser control, contract review, video generation, resume screening, and enterprise information analysis, and related capabilities are still being continuously expanded.
Aipy can not only help us think but also help us work. If you're looking for an AI tool that can accompany you in your work for a long time, Aipy is worth experiencing.
🎁 Exclusive Benefits
Do you want to experience the new - generation super AI assistant Aipy?
Register now and fill in the invitation code 👉RPF2👈 to get 3.5 million Tokens for free!!
The usage method is as follows:
① Visit the Aipy official website: https://www.aipyaipy.com/, and download the latest version of the Aipy client.
② Fill in the above invitation code when registering and logging in.
Guys! Microsoft Copilot is making big news again 🎉 Today, it officially rolls out OpenAI's most powerful model, GPT - 5.2, and it's a free upgrade! This directly ushers in a new era of "expert - level" workflows, pushing office efficiency to the limit.
🌟 Two Models Co - exist, and the Thinking - type is More Powerful
GPT-5.2 and GPT - 5.1 are both available. The Plus version is a "thinking - type" variant - simply put, it's better at in - depth thinking! When dealing with tables, writing review codes, and processing long documents, it's incredibly fast. It can also handle complex tool calls and image analysis.
🚀 Performance Doubles, Crushing Professionals
In 44 professional task tests, GPT - 5.2Thinking was actually 70.9% superior to / on par with industry experts (previously, GPT - 5 was only 38.8%)! Whether it's creating PPTs, scheduling, or producing professional deliverables, it's more reliable than the consultants you hire, taking office automation to a new level.
🔧 A Perfect Score in Rigorous Tests, Mastering Programming and Math
In the programming field: The SWE - Bench Pro test set a new record, far outperforming GPT - 5.1Thinking;
In math competitions: It got a perfect 100% score in AIME2025 and 92.4 points in the GPQA Diamond logic test;
In logic and science: There has been a significant improvement in CharXiv reasoning and ARC - AGI - 2, evolving from a basic assistant to a "digital intelligence entity".
Now it can be used on web pages / Windows / mobile devices. Experience the power of expert - level AI for free! Have you guys tried Copilot's new features? Come and share your office efficiency tools in the comments section below 👇
So as 2025 wraps up, we’ve gone headfirst into a mountain of de-identified data, searching for the quirks, surprises, and secret patterns that shape everyday life with Copilot. We’re finding out just how far it fits into people’s daily rhythms, and how human its uses have become: we often turn to AI for the things that matter most like our health. We analyzed a sample of 37.5 million conversations to find out how people actually use it out in the world. (Note: our system doesn’t just de-identify conversations; it only extracts the summary of the conversation, from which we learn the topic and the intent, and maintains full privacy.)
From health tips that never sleep, to the differences between weekday and weekend usage, to February’s annual “how do I survive Valentine’s Day?” spike, our findings show that Copilot is way more than a tool: it’s a vital companion for life’s big and small moments. And if you’ve ever pondered philosophy at 2 a.m. or needed advice on everything from wellness to winning at life, you’re in good company. So has everybody else.
Our work shows that AI is all about people, a trusted advisor slotting effortlessly into your life and your day. It’s about your health, your work, your play, and your relationships. It meets you where you are. Read all about it in our paper, but here are some of our takeaways.
Health Is Always on Our Minds—Especially on Mobile
No matter the day, month, or time, health-related topics dominate how people use Copilot on their mobile devices. Whether it’s tracking wellness, searching for health tips, or managing daily routines, our users consistently turn to Copilot for support in living healthier lives. This trend held steady throughout the year, showing just how central health is to our everyday digital habits. When it comes to mobile, with its intimacy and immediacy, nothing tops our health.
Most common Topic-Intent pairing conversations, on mobile.
Health is consistently the most common topic while interestingly, language-related chats peak earlier in the year, with entertainment seeing a steady rise.
When Programming and Gaming Cross Paths
August brought a unique twist: programming and gaming topics started to overlap in unexpected ways. Our data showed that users were just as likely to dive into coding projects as they were to explore games—but on the different days of the week! This crossover hints at a vibrant, creative community that loves to code during the week and play during the weekends in equal measure.
August topic ranks for programming and games.
There is a clear change in rank between programming and games through the days of the week, with programming rising from Monday to Friday, and Games shining on the weekends.
February’s Big Moment
February stood out for another reason: Copilot helped users navigate a significant date in their calendar year. Whether it was in preparing for Valentine’s day, or facing the day and the relationships, we saw a spike in activity as people turned to Copilot for guidance, reminders, and support. It’s a great reminder of how digital tools can make life’s important moments a little easier to manage.
Ranking of “Personal Growth and Wellness” and “Relationship” conversations February brings concerns of personal growth before Valentine’s day, with a clear peak of relationship-related conversations on the day.
Late-night Sessions
The larger-than-life questions seem to have a rise during the early hours of the morning, with “Religion and Philosophy” rising through the ranks. Comparatively, travel conversations happen most often during the commuting hours.
Average rank of Travel and Religion and Philosophy conversations per hour of the day.
Whilst people have more travel-related conversations during the day, it’s in the early hours of the morning that we see a rise of Religion and Philosophy conversations. 虽然人们在白天有更多与旅行相关的对话,但正是在凌晨时分,我们看到宗教与哲学对话有所增加。
Advice on the Rise
While searching for information remains Copilot’s most popular feature, we’ve seen a clear rise in people seeking advice—especially on personal topics. Whether it’s navigating relationships, making life decisions, or just needing a bit of guidance, more users are turning to Copilot for thoughtful support, not just quick answers. This growing trend highlights how digital tools are becoming trusted companions for life’s everyday questions.
Why These Insights Matter
By analyzing high level topics and intents, we manage to learn all these insights while keeping maximum user data privacy. Understanding these patterns helps us make Copilot even better. By seeing what matters most to our users—health, creativity, and support during key moments—we can design features that truly fit into their life. It’s also clear from these uses that what Copilot says matters. They show why it’s so important that we hold ourselves to a high bar for quality.
New audio model snapshots and broader access to Custom Voices for production voice apps.
AI audio capabilities unlock an exciting new frontier of user experiences. Earlier this year we released several new audio models, including gpt-realtime, along with new API features to enable developers to build these experiences.
Last week, we released new audio model snapshots designed to address some of the common challenges in building reliable audio agents by improving reliability and quality across production voice workflows–from transcription and text-to-speech to real-time, natively speech-to-speech agents.
The new snapshots share a few common improvements:
With audio input::
Lower word-error rates for real-world and noisy audio
Fewer hallucinations during silence or with background noise
With audio output::
More natural and stable voice output, including when using Custom Voices
Pricing remains the same as previous model snapshots, so we recommend switching to these new snapshots to benefit from improved performance for the same price.
If you’re building voice agents, customer support systems, or branded voice experiences, these updates will help you make production deployments more reliable. Below, we’ll break down what’s new and how these improvements show up in real-world voice workflows.
Speech-to-speech
We’re deploying new Realtime mini and Audio mini models that have been optimized for better tool calling and instruction following. These models reduce the intelligence gap between the mini and full-size models, enabling some applications to optimize cost by moving to the mini model.
gpt-realtime-mini-2025-12-15
gpt-realtime-mini model is meant to be used with the Realtime API, our API for low-latency, native multi-modal interactions. It supports features like streaming audio in and out, handling interruptions (with optional voice activity detection), and function calling in the background while the model keeps talking.
The new Realtime mini snapshot is better suited for real-time agents, with clear gains in instruction following and tool calling. On our internal speech-to-speech evaluations, we’ve seen an improvement of 18.6 percentage points in instruction-following accuracy and 12.9 percentage points in tool-calling accuracy compared to the previous snapshot, as well as an improvement on the Big Bench Audio benchmark.
Together, these gains lead to more reliable multi-step interactions and more consistent function execution in live, low-latency settings.
For scenarios where agent accuracy is worth a higher cost, gpt-realtime remains our best performing model. But when cost and latency matter most, gpt-realtime-mini is a great option, performing well on real-world scenarios.
For example, Genspark stress-tested it on bilingual translation and intelligent intent routing, and in addition to the improved voice quality, they found the latency to be near-instant, while keeping the intent recognition spot-on throughout rapid exchanges.
gpt-audio-mini-2025-12-15
The gpt-audio-mini model can be used with the Chat Completions API for speech-to-speech use cases where real-time interaction isn’t a requirement.
Both new snapshots also feature an upgraded decoder for more natural sounding voices, and better maintain voice consistency when used with Custom Voices.
Text-to-speech
Our latest text-to-speech model, gpt-4o-mini-tts-2025-12-15, delivers a significant jump in accuracy, with substantially lower word error rates across standard speech benchmarks compared to the previous generation. On Common Voice and FLEURS, we see roughly 35% lower WER, with consistent gains on Multilingual LibriSpeech as well.
Together, these results reflect improved pronunciation accuracy and robustness across a wide range of languages.
Similar to the new gpt-realtime-mini snapshot, this model sounds much more natural and performs better with Custom Voices.
Speech-to-text
The latest transcription model, gpt-4o-mini-transcribe-2025-12-15, shows strong gains in both accuracy and reliability. On standard ASR benchmarks like Common Voice and FLEURS (without language hints), it delivers lower word error rates than prior models. We’ve optimized this model for behavior on real-world conversational settings, such as short user utterances and noisy backgrounds. In an internal hallucination-with-noise evaluation, where we played clips of real-world background noise and audio with varying speaking intervals (including silence), the model produced ~90% fewer hallucinations compared to Whisper v2 and ~70% fewer compared to previous GPT-4o-transcribe models.
This model snapshot is particularly strong in Chinese (Mandarin), Hindi, Bengali, Japanese, Indonesian, and Italian.
Custom Voices
Custom Voices enable organizations to connect with customers in their unique brand voice. Whether you’re building a customer support agent or a brand avatar, OpenAI’s custom voice technology makes it easy to create distinct, realistic voices.
Theese new speech-to-speech and text-to-speech models unlock improvements for custom voices such as more natural tones, increased faithfulness to the original sample, and improved accuracy across dialects.
To ensure safe use of this technology, Custom Voices are limited to eligible customers. Contact your account director or reach out to our sales team to learn more.
From prototype to production
Voice apps tend to fail in the same places, mainly on long conversations or with edge cases like silence, and tool-driven flows where the voice agent needs to be precise. These updates are focused on those failure modes—lower error rates, fewer hallucinations, more consistent tool use, better instruction following. And as a bonus, we’ve improved the stability of the output audio so your voice experiences can sound more natural.
If you’re shipping voice experiences today, we recommend moving to the new 2025-12-15 snapshots and re-running your key production test cases. Early testers have confirmed noticeable improvements without changing their instructions and simply switching to the new snapshots, but we recommend experimenting with your own use cases and adjusting your prompts as needed.
Guys, artificial intelligence has been constantly changing the way enterprises operate. In the past, the emphasis was on intelligent assistants, but they could only respond passively. Now, Agentic AI has arrived, and this is a major evolution 🔥!
Traditional AI assistants can only perform isolated tasks and have limitations. However, Agentic AI can make autonomous decisions, coordinate multi - step actions, actively assess the environment, initiate actions, and coordinate cross - departmental work processes. It's really amazing 👏!
For enterprise leaders, this brings both opportunities and responsibilities. It has great potential, but also poses significant challenges in terms of governance, trust, and design. Enterprises must be able to monitor and reverse the actions of Agentic AI.
Enterprise work processes also need to be re - thought. We can no longer design processes step - by - step and insert automation. Instead, we need to build an intelligent ecosystem, consider which decisions should be made by humans and which by agents, and ensure correct data acquisition.
A unified platform is extremely important at this time. Without it, agents may become disjointed. A unified approach can provide standards, achieve interoperability, reduce complexity, and enable large - scale implementation.
Trust and accountability are also indispensable. Since agents act independently, the risks increase. Trust and accountability need to be integrated from the very beginning, with clear policies to make employees believe that it is a partner.
Enterprises should measure the business value as early as possible and not let projects remain only at the pilot stage. Well - designed Agentic AI can bring exponential improvements and transform enterprise performance.
The rise of Agentic AI is not about handing over power to machines, but a new stage of enterprise transformation where humans and agents fight side by side. Leaders should first conduct pilots and then expand, invest in a unified platform and policy framework, and foster a good culture.
Hey everyone! AI agents are transforming businesses—now is the perfect time for business leaders to step up and shine 💪!
Keywords
#Agentic AI #Enterprise Transformation #Work Process Remodeling #Unified Platform #Trust and Accountability
Guys, the annual blockbuster report on the consumer - grade AI market recently released by a16z, a top venture capital firm in Silicon Valley, is really mind - blowing! 🔥 The competition in the general AI assistant track is extremely fierce right now. Users usually only choose one main product, and the "winner - takes - all" pattern is accelerating.
The report shows that although the usage rate of AI has increased, users' willingness to use it across platforms is extremely low. Take ChatGPT's weekly active users as an example. Less than 10% of them will use other AI services simultaneously. Among mainstream products, only about 9% of users will pay for multiple assistants.
Currently, OpenAI is still remarkable, leading with 800 - 900 million weekly active users. However, its "super - app" strategy faces challenges. Google, with its "experimental field" model, has made Gemini catch up rapidly. The number of desktop users has increased by 155% year - on - year, and the growth rate of paid subscriptions is nearly twice that of ChatGPT. 👏
Judging from the data, ChatGPT has a leading user volume and high user stickiness. The ratio of daily active users to monthly active users is twice that of Gemini. But Gemini is growing at an astonishing rate, especially in terms of the growth of paid users, leaving ChatGPT far behind.
In terms of product strategies, OpenAI is like building a "walled garden", stuffing various functions into ChatGPT, but this makes the interface more complex. Google, on the other hand, adopts the "experimental field" model, allowing innovative products to develop independently, but its products are a bit scattered.
Other players also have their own unique skills. 👍 Anthropic's Claude focuses on technical users, and its programming assistant generates considerable revenue. Perplexity serves non - technical groups who value efficiency. Elon Musk's xAI product Grok is growing extremely fast, and its function iteration is also remarkable. It is said to be the AI product with the fastest - evolving capabilities.
The key to the future competition of AI assistants lies in who better understands users' needs and can transform them into good business models. Guys, who do you favor more? 🤔