If AI can now speak Italian, it can certainly replace us...

lseif@sopuli.xyz · 1 year ago

If AI can now speak Italian, it can certainly replace us...

RacoonVegetable@reddthat.com · 1 year ago

I felt that when he said *83h400+93）*38hpfhi0

impure9435@kbin.run · 1 year ago

The thing that I find the most funny about this post, is the fact that you call this Italian

LalSalaamComrade@lemmy.ml · 1 year ago

Blud could’ve chosen Runic, Egyptian, Ancient Romanian used by Vlad the Impaler, Mesapotamian or even Harappan Indic. But Italian is it.

IronKrill@lemmy.ca · 1 year ago

Blud I’m gonna be fr no cap rn but wtf does blud mean I’ve been meaning to ask for months and I still don’t get it

LalSalaamComrade@lemmy.ml · 1 year ago

It’s a Jamaican slang for ‘friend’ or ‘brother’.

IronKrill@lemmy.ca · 1 year ago

Thanks blud.

lseif@sopuli.xyz · 1 year ago

how am i supposed to know how italians speak. i’ve never seen one

Meowie Gamer@lemmy.world · 1 year ago

It’s a me, Mario!

thesporkeffect@lemmy.world · 1 year ago

They’re not real, but they can hurt you.

lseif@sopuli.xyz · 1 year ago

like reverse vampires ?

pewpew@feddit.it · 1 year ago

Ne sei sicuro?

lars@lemmy.sdf.org · 1 year ago

That’s right! None of us knows how Italians can speak in the dark 🤌

jballs@sh.itjust.works · 1 year ago

From my experience, they speak mostly with their hands

Terrasque@infosec.pub · 1 year ago

🫰🤙🫵👌✊🫳🫸🤲🤌

jballs@sh.itjust.works · 1 year ago

Prego

Sidyctism II.@discuss.tchncs.de · 1 year ago

Ditto

RVGamer06@sh.itjust.works · 1 year ago

Typical 'muricans being unable to comprehend anything besides English.

/s i don't mean to be racist

yes i was a r/2we4u user, how’d you know?

Hupf@feddit.de · 1 year ago

Well, it certainly doesn’t overflow on 32 bit systems

9point6@lemmy.world · 1 year ago

Never go full APL

XEAL@lemm.ee · 1 year ago

Ah, I see you’re using FartGPT instead of ChatGPT

Blyfh@lemmy.world · 1 year ago

French pronunciation intensifies

Lem Jukes@lemm.ee · 1 year ago

Cat, I farted.

lseif@sopuli.xyz · 1 year ago

is that the new model ?

r00ty@kbin.life · 1 year ago

Wow, an alien ion drive formula! Try to get warp drive out of it too!

abrahambelch@programming.dev · 1 year ago

Which language uses these signs? It truly looks like some kind of alien language

driving_crooner@lemmy.eco.br · 1 year ago

Unown

chapapa@discuss.tchncs.de · edit-2 1 year ago

Glagolitic script. Oldest known Slavic alphabet according to Wikipedia.

LalSalaamComrade@lemmy.ml · 1 year ago

They should revive this script. I like it more than Cyrillic.

82cb5abccd918e03@lemmygrad.ml · 1 year ago

I found it! its the Glagolitic script used in the 9th century before Cyrillic took over:

ⰀⰁⰂⰃⰄⰅⰆⰇⰈⰉⰊⰋⰌⰍⰎⰏⰐⰑⰒⰓⰔⰕⰖⰗⰘⰙⰚⰛⰜⰝⰞⰟⰠⰡⰢⰣⰤⰥⰦⰧⰨⰩⰪⰫⰬⰭⰮⰰⰱⰲⰳⰴⰵⰶⰷⰸⰹⰺⰻⰼⰽⰾⰿⱀⱁⱂⱃⱄⱅⱆⱇⱈⱉⱊⱋⱌⱍⱎⱏⱐⱑⱒⱓⱔⱕⱖⱗⱘⱙⱚⱛⱜⱝⱞ

peto (he/him)@lemm.ee · 1 year ago

I think it’s the Ge’ez script used in Ethiopian.

82cb5abccd918e03@lemmygrad.ml · 1 year ago

Doesn’t look like it to me:

ልዩ ጊዜ ነበር። አሁን የሚሆነውን ለማስተዋል የኢንተርኔት አውራጃ ማረጋገጥ ነበር።

peto (he/him)@lemm.ee · 1 year ago

Yeah, you are right.

Sunoc@sh.itjust.works · 1 year ago

I would like to know too! Never saw that writing system before.

nimpnin@sopuli.xyz · 1 year ago

APL?

82cb5abccd918e03@lemmygrad.ml · 1 year ago

No that looks like

⌶⌷⌸⌹⌺⌻⌼⌽⌾⌿⍀⍁⍂⍃⍄⍅⍆⍇⍈⍉⍊⍋⍌⍍⍎⍏⍐⍑⍒⍓⍔⍕⍖⍗⍘⍙⍚⍛⍜⍝⍞⍟⍠⍡⍢⍣⍤⍥⍦⍧⍨⍩⍪⍫⍬⍭⍮⍯⍰⍱⍲⍳⍴⍵⍶⍷⍸⍹⍺

1 year ago

Redex@lemmy.world · 1 year ago

Damn, wild Glagolitic script found. I didn’t even realise it was in the Unicode standard.

Annoyed_🦀 @monyet.cc · 1 year ago

That’s not italian that’s obviously Unown

Empricorn@feddit.nl · 1 year ago

https://youtu.be/J6dFEtb06nw?si=PfjT-GS9tBmiPvWI

Vitaly@feddit.uk · 1 year ago

It looks so badass, I could have used that script now because im Ukrainian but instead I have cyrillic script which is so boring

Match!!@pawb.social · 1 year ago

rebel against Russian imperialism, return to glagolitic

Vitaly@feddit.uk · edit-2 1 year ago

It’s not russian, If my bulgarian friend is right then it was created by a bulgarian guy

TwilightKiddy@programming.dev · 1 year ago

There is no single person responsible for Cyrillic script. It is mostly believed to be created by mixing and changing Greek and Glagolic scripts by the scholars of Preslav Literary School, which was indeed in Bulgaria. After a while, Peter the Great changed it a lot. And then Stalin stomped out almost all the deviations in the usage of the script.

The last part is mostly why it is considered Russian. A lot of languages suffered because of Moscow just forcing them to use the version of Cyrillic that Russians were using.

NIB@lemmy.world · edit-2 1 year ago

Cyrillic is literally greek+glagolitic and it was partly a diplomatic creation of the Eastern Roman Empire(aka Byzantine Empire), in order to bring the slavs culturally closer to them.

Russians have nothing to do with it, other than them claiming they are the continuation of Eastern Roman Empire, something which is kinda laughable but whatever dont let your dreams be dreams.

NotSpez@lemmy.ml · 1 year ago

We are so cooked

BlueMagma@sh.itjust.works · 1 year ago

Looks like UiUa: uiua.org

stingpie@lemmy.world · 1 year ago

This might be happening because of the ‘elegant’ (incredibly hacky) way openai encodes multiple languages into their models. Instead of using all character sets, they use a modulo operator on each character, to make all Unicode characters represented by a small range of values. On the back end, it somehow detects which language is being spoken, and uses that character set for the response. Seeing as the last line seems to be the same mathematical expression as what you asked, my guess is that your equation just happened to perfectly match some sentence that would make sense in the weird language.

NeatNit@discuss.tchncs.de · 1 year ago

I suppose it’s conceivable that there’s a bug in converting between different representations of Unicode, but I’m not buying and of this “detected which language is being spoken” nonsense or the use of character sets. It would just use Unicode.

The modulo idea makes absolutely no sense, as LLMs use tokens, not characters, and there’s soooooo many tokens. It would make no sense to make those tokens ambiguous.

stingpie@lemmy.world · 1 year ago

I completely agree that it’s a stupid way of doing things, but it is how openai reduced the vocab size of gpt-2 & gpt-3. As far as I know–I have only read the comments in the source code– the conversion is done as a preprocessing step. Here’s the code to gpt-2: https://github.com/openai/gpt-2/blob/master/src/encoder.py I did apparently make a mistake, as the vocab reduction is done through a lut instead of a simple mod.

PlexSheep@infosec.pub · 1 year ago

Do you have a source for that? Seems like an internal detail a corpo wouldn’t publish

stingpie@lemmy.world · 1 year ago

Can’t find the exact source–I’m on mobile right now–but the code for the gpt-2 encoder uses a utf-8 to unicode look up table to shrink the vocab size. https://github.com/openai/gpt-2/blob/master/src/encoder.py

crispy_kilt@feddit.de · 1 year ago

Seriously? Python for massive amounts of data? It’s a nice scripting language, but it’s excruciatingly slow

stingpie@lemmy.world · 1 year ago

There are bindings in java and c++, but python is the industry standard for AI. The libraries for machine learning are actually written in c++, but use python language bindings. Python doesn’t tend to slow things down since machine learning is gpu-bound anyway. There are also library specific programming languages which urges the user to make pythonic code that can be compiled into c++.