- cross-posted to:
- linux@programming.dev
- cross-posted to:
- linux@programming.dev
https://www.youtube.com/watch?v=qHepKd38pr0
One more step towards the discovery of what androids dream about.
God fuck them. They are hunting for hype and not making linux better for the masses
Thanks Canonical…I’ll just throw it in the pile with all the other “wonderful” things you’ve made. It can go on the shelf next to Mir.
maybe it will lead to better accessibility tooling. This is obviously rather silly as a default mode of operations, especially if you imagine people in a crowded office all yelling at their computers.Being real, this is why I fucking hate the bullshit, corporate greed hype of LLMs and generative software. All the “bubble” shit? It tars all versions of the technology with the same brush.
This? This is exactly what it should be used for. And, ffs, earlier speech to text was really the same fucking thing in essence. Software that took input in the form of voice, compared it to a set of data, and made a best guess at what you meant. Yeah, the details are different, but it’s the same concept.
This? This is fucking awesome. Locally run, and doing a job that’s vital in accessibility, with the side benefit of being useful to others. Assuming canonical is being honest anyway.
But this kind of thing should be the way things are done.
That’s why we should not call everything AI
But it is AI, so it should be called that. People should adjust their simplistic notions about the term instead
This would have been better received if they just didn’t use AI in the name. Sure, it’s just using an LLM under the hood, but it’s running purely locally. It also betters Linux since it helps address an accessibility issue.
Canonical can take its AI and walk into the sea
You don’t like accessibility?
Not for that price.
The “price” of a free offline speech to text AI model? Three of them, actually, to work with varying levels of compute resources available?
You anti-AI folks are friggin’ ridiculous.
I don’t think it’s anti-AI more a lack of trust of services saying here’s a product…and the concern it will be used for ulterior motives. I know I don’t like my voice being captured.
It’s captured when you press a button and only handled locally. This is exactly the sort of thing you want for accessibility. Not everybody can type or type well.
I agree. It’s a great use. I just can’t trust any tech company now. Even if they mean well abuses happen.
I admit I don’t know the details, but the title makes it seem like there is a “product” there, by a “company”, probably in it for the profit. And since there is a huge problem with datacenters as it is, why would we encourage more? Most of you AI enthousiasts are blindly walking us into a pit of regret.
I admit I don’t know the details, but the title makes it seem like there is a “product” there, by a “company”, probably in it for the profit.
You don’t know more than the details - you don’t know anything about it.
And since there is a huge problem with datacenters as it is, why would we encourage more? Most of you AI enthousiasts are blindly walking us into a pit of regret.
Guess you’ll want to research what “offline” means. I doubt you have any idea what any problems with datacenters are either given your… we’ll call it “knowledge” of the situation.
Nothing AI is free. Unless there’s a chain of custody for all of the training data, it’s still unethical even if it’s used for a good thing. If I build a wheelchair ramp out of the flesh and bones of orphans I’m still not a very good person. And there are non-AI ways to accomplish this that are just as good that would require almost comically less resources.
This attitude is why Ubuntu, and only Ubuntu, recommends a minimum of 6 GB of Ram btw. You can run a full KDE system with onboard graphics and all the bells and whistles for less than 2 GB on other distros.
Nothing AI is free. Unless there’s a chain of custody for all of the training data, it’s still unethical even if it’s used for a good thing.
This is the weirdest sort of AI bullshit I keep coming across.
And there are non-AI ways to accomplish this that are just as good that would require almost comically less resources.
And… Where are they?
This is the weirdest sort of AI bullshit I keep coming across.
Hi this must be your first time on Earth in the last decade, every single AI company has been in or is currently in no less than ten dozen lawsuits over copyright infringement. It’s so bad there’s at least one website purpose built to track copyright infringement from AI companies..
Without a specific chain of custody for every piece of training data going into the models, there is a default that the model cannot be trusted and is likely infringing on someone’s copyright.
To specify the 'nothing AI is free" part, LLMs are grossly computationally inefficient. Whether it’s local or not.
And… Where are they?
Already installed on most distros.
Fuck AI.
Well “reasoned” argument there. You’ll go far in MAGA.
The framework split things into two groups, implicit AI that quietly improves what you already use and explicit AI that are features you’d actually summon on purpose.
The very first paragraph already upsets me. Have in mind, I would criticize this on every other operating system too. I believe no one should use Ai tools that act autonomously in the background, to improve or change what you already use. It should always be a “summon on purpose”.
Offline-only speech-to-text, integrated with the desktop for push-to-talk voice typing? That’s the kind of AI that I’d like to see. Actually add features that can help people without harming their rights. I’m still moving new machines to Debian but this is nice.
yeah but this kinda adds climate changes cuz more PCs warm up
Also how is speech to text AI? It has existed for decades, obviously a lot better now but I don’t think I’d consider it “AI”
There’s been ML and non-ML ways of doing STT over the years. as far as I recall. The current best implementations are ML-based. In coloquial terms ML algorithms are AI. We used to call them AI in the 2010s, before AI was (un)cool.
I have been using a speech-to-text AI system the last day, and I’m using a whisper large 3 turbo and a rewording model that fixes the sentence but doesn’t rewrite it, and it’s almost perfect. I’m using European-hosted AI through cortecs.ai, and it’s really cheap.
Bit of a click baity title lol. If there’s one good use of AI, its probably accessibility.
Yeah, “Ubuntu wants you to use their new feature” is… unsurprising. Explaining the benefits and purpose of that feature? Now you’re talking.
Implicit optional features to use local LLMs for STT is something that I think most reasonable people could get behind. Too many accessibility tools for the disabled sit behind paywalls and subscription models.
My grandma used to hate tech until she learned to use the voice assistance on her mobile phone. It unlocked the phone for her because she doesnt have the dexterity to type. I hope one day this tool could get to that same point.
Out of curiosity would it be accurate to call this sort of technology generative ai, or just machine learning? Or it depends on the implementation?
I feel like most of the anger around ai is because gen ai has a bunch of harmful baggage, and I’m curious if this is an example of gen ai having a productive use case, or an example of ai being more useful outside of gen ai specifically
This is genai
Based on their explanation, this is still using genAI. They talk about pre-processing the data and chunking it before it’s sent to the inference model.
That’s just how LLMs read data, it could just be for a text search. The problem is where that data came from, if they’re outputting text from it, if they’re getting people to trust that output, and if they’re getting kickbacks from Nvidia for it.
Erm, no.
Come on.
Offline-only is privacy-respecting. Accessibility is a noble goal.
All in all, if there’s an AI usecase that’s as morally acceptable as it gets, it’s this one.
I get that it’s Ubuntu of all people, but even Big Tech produces some ideas every now and then that FOSS lovers can get behind and democratize!
More AI stuff as usual, waiting to see demonstration how this will work


















