You can already do that on device even with whisper, though without swapping only tiny/small models, maybe with 8gb ram medium comes into play. Not sure what you expect from KBs? Science is never settled, so if you get proponents of one theory to build KB of course it will contradict KB built by guys who support different one, there is no objective truth lol
Every knowledge base, if the knowledge is developing must be curated and every source can be biased - it’s not a technical problem.
Edit: What I ideally expect from a knowledge base is that I can ask questions by voice about facts I entered and get audio responses.
Is it developing when the current mainstream knowledge blacklists people trying to upend it, or is it stagnating and keeping the status quo, will you load the einstein KB when you have aether KB already and all MSM says aether real?
I’m not getting my news or relevant political/historical/societal facts from next-word-predictor AI (e.g. ChatGPT), which as we all should know can hallucinate and are also heavily biased towards western hegemony and in other ways IMO.
What you’re talking about is not a problem of AI in general. It’s a matter of media competence.
No, it’s a problem of just getting facts from pre-approved source, where ‘experts’ vow to be right, you will not get any new knowledge with aether KB, why even query it lol
Yeah, that’s why i don’t.
Edit: You mentioned whisper. If i’m not mistaken, this is only speech recognition and therefore only part of the heavy lifting an AI assistant needs to do. From mind 2 i expect an SDK with a more complete AI framework.
Whisper is stt (with translation to english for free) from like 50+ languages, I’m sure there are open tts’s with similar results? Not sure, check out speech note on openrepos it uses a lot of different backends, so some might be two way, and again for M2 details, sell your soul and phone number to discord
“I’m sure there are open tts’s with similar results?”
Last time I checked, around 3 or 4 years ago on Nvidia Xavier AGX (with >30 TOPS vs. 6 TOPS of Mind2 ), they were so slow i didn’t want to pursue it further. Quality was acceptable.
Discord only has my email? Do they force mobile accounts now?
No clue, keep us posted
I would assume language models were a lot less efficient those “3 or 4 years ago”, so you probably need a lot less computational power to achieve the same result nowadays.
And Discord doesn’t necessarily require a phone number, only when certain “anti-bot” measures are being triggered due to certain markers being deemed (too) suspicious, from my experience.
I’m hoping for the same, but I would like to see some more hard facts before I decide to contribute (via pre-order) to the project.
Naive question: For what shall this Mind2 be good? What can a housewife do with it? Is it a kind of NAS or router? What does this device?
One simple idea: detect scheduled events in my emails and put them in my calendar (similar to the Android/iOS feature), act as an automatic personal information manager (e.g. aggregating contact data)
I forgot one use case: prepare automatic email replies (chatGPT-like: please draft a kind response to the email excusing that I will not attend to the event I am invited to (or confirming that I will attend - and place the event in the calendar))
Is this usable as spam filter?
It is advertised as being able to
You should be able to “talk” (only writing? i don’t know) to it in human language and can tell it to do stuff in these categories.
I can imagine there will be a spam handling agent (i.e. a spam filter)
Yeah, but the advertised capabilities are all rather unspecific
have you eard of pi-hole? this is a raspberry pi as an adblocker, i guess it acts as a proxy server. i thought one can use mind2 as this, but for that alone it would be very expensive. its main purpose is to let a large language model be run locally and use sensitive data that you don’t want to give an llm from microsoft or google
Is this written text or speech recognition? Does it have a microphone? Is it similar to something like “Alexa”?
i think you can modify it to act like alexa, but i don’t think it comes with a microphone, although one should be able to add one. i see it as an asic for llms