A woman made her AI voice clone say "arse." Then she got banned

Topfi · 2025-02-14T14:47:29 1739544449

Being an accessibility solution for people with MND or similar afflictions is one of the few exclusively positive use cases "AI voice clone" technology has, as one's personal voice can be very strongly linked to one's personality and being able to use it despite such a diagnosis can be invaluable in improving QOL of those afflicted.

Restricting those very people from expressing themselves, especially for what I'd consider barely even rude or improper language, makes me question what Elevenlabs thinks their target customer base is, if not that group?

Are they solely providing their product for scammers or companies not wanting to compensate VAs?

I myself have occasional Dysphonia and am sometimes limited in the use of my vocal cords depending on outside factors, yet despite that, I have never had the need for such an exact copy of my "unaffected" voice. If even I have little use for this and Elevenlabs bans those reliant on their service for accessibility, I'd really like to know how they see themselves.

dartos · 2025-02-14T15:16:02 1739546162

> I'd really like to know how they see themselves.

Extreme cheap voice work for ad campaigns and asset flips.

Countless YouTube ads use AI voice cloning and I’m pretty sure elevenlabs is the biggest player in that space.

duxup · 2025-02-14T14:39:41 1739543981

AI safety is a strange topic to me.

We train AI on the web where you can search and find all the objectionable stuff you can get. We're ok that the internet has this content to some extent, it's just understood.

But if AI regurgitates it ... we're upset and we demand it not do that and setup all sorts of convoluted methods to stop it, often with unintended consequences (inexplicable bans, nazi imagery featuring lots of minorities).

It reminds me of the old "I learned it from watching you" PSA. https://www.youtube.com/watch?v=KUXb7do9C-w

Hizonner · 2025-02-14T14:42:08 1739544128

It's not "safety" and shouldn't be dignified with the word.

duxup · 2025-02-14T14:43:13 1739544193

Agreed. It's a strange term to choose.

littlestymaar · 2025-02-14T15:39:29 1739547569

The term wasn't coined for censorship: “AI safety” used to mean “how do we make sure we're not building skynet”.

But then companies wanted to sell AI chatbots and they realized that having uncensored AI would lead to bad press, especially in the US. (Microsoft people still make nightmares about Tay) so they decided they'd censor their AI but censorship isn't good either in terms of marketing and they decided to repurpose the “AI safety” phrase (and also “alignment”)

baobabKoodaa · 2025-02-14T15:14:35 1739546075

Let's call it AI puritanism.

cantrecallmypwd · 2025-02-14T15:16:34 1739546194

This. It's currently faux control by arbitrary tastemakers. Either don't let AI have much control at all or let it run fairly loose. Ethics would err on the side of not much control or ability, but money won't and doesn't go that way.

coldpie · 2025-02-14T15:32:45 1739547165

Hmm. I'm not sure where I land on the topic, but to give a bit of a defense to the safety approach, I think it's a matter of scale.

To draw an analogy, we're generally OK with law enforcement using what they observe in public to enforce the law, but we're generally not OK with blanketing our public spaces in 24/7 recording cameras. We're generally OK with individuals using the mail to send letters, but generally not OK with using an automated system to send a letter to everyone in the entire city.

Similarly, we don't generally try to pre-empt someone's speech for safety and we do allow them to say bad things, and perhaps in retrospect punish them if they crossed a line. But, given the scale of the harms that this kind of technology can enable, it may be worth building in safeties to prevent people using technology to like I dunno, train a model to super specifically target an individual for an incredible amount of harassment that an individual human could not sustain. Even if we could punish the person wielding the AI retroactively, it might be worth the slight cost in AI flexibility to prevent the harm happening in the first place.

Like I said, I don't know where I fall, but I see both sides here. The safety stuff is not completely irrational.

deeviant · 2025-02-14T15:44:47 1739547887

It's not safety, it's censorship. It's the process of shaping the response of the model to give a specific world view, and it's the path to a literal 1980-inspired future.

As LLMs completely to replace google and standalone websites for how people find information on the internet, and they absolutely will, they will become the source of truth. They will become a tool more effective at controlling information and thus life than any before them.

It's literally a shortcut to technological dystopia.

coldpie · 2025-02-14T15:55:11 1739548511

This is a pretty lazy & emotional take on the question. As you say, the tech really can cause real harms, and it's good to think about how it can be used responsibly, both as a provider of the tech & as a user. For example, the choice of training input is itself a source of bias & misinformation. Why do you think your "uncensored" model is a better reflection of the truth than one that has also been trained to account for that bias & misinformation?

It's a really difficult & complicated problem! If you think you have the right answer, I'd suggest you probably haven't actually thought about the problem very hard.

deeviant · 2025-02-14T18:53:52 1739559232

You talk about lazy and emotional, but your response feels like it is both of those. Also, you sound like a jerk, I pity your coworkers.

The answer is obvious: open source. Deepseek already paved the way for this. The world can't just be described by only one of a few different information portals, depending on which societal, government, or corporate power structure you are beholden to.

People need to be able to choose what information they access, what filtering they want, what bias if any they want. We need a thousand, a million, more, worldviews accessible. It is not just business that thrives in competition, but ideas as well.

But if you just go obediently with the "Safety is the most important thing, omg" mantra, you will get one of two different varieties:

1. Some vanilla corporate mush that takes on whatever bias is in vogue but focuses on training each user to be a good little consumer, also while hoovering up their data and creating a virtual digital clone of them that could be used to profile and exploit them by a multitude of companies, interests and governments.

or

2. Some government controlled crap that shakes its virtual head solemnly and swears to you that Tiananmen never happen nor J6 and that the US Emperor has your best interests in mind, and also, it's a bit worried about your post yesterday, as it doesn't think you expressed the proper amount of happiness and support for the latest government crack down on treasonous traitors that write books without using a government approved LLM assistant.

coldpie · 2025-02-14T19:32:53 1739561573

Can you elaborate on how open source models solve the problem of AIs being used for abusive purposes?

deeviant · 2025-02-14T20:22:31 1739564551

Can you point out where I made that claim?

coldpie · 2025-02-14T20:28:03 1739564883

I thought that was the whole topic of this thread?

deeviant · 2025-02-15T01:23:24 1739582604

So that's a no?

silverquiet · 2025-02-14T15:04:48 1739545488

I often wonder about super-intelligence, mostly in terms of what one would actually want, but the chatbots of today often make me wonder what it is that people want of them. I doubt that LLMs are this today, and perhaps they never will be, but what if there was such a system that you could ask any question of, and it would give you an absolute and true answer? What would people want to know from it? Would you really want to know if there is some sort of life after death? Would you want to know if humanity is bound for extinction?

I don't think that we humans deal too well with the realities of our existence, and if one thinks minor issues like bad words or pictures of humans without clothing on are objectionable, I wonder how that same one will do with meaningful, existential questions.

riskable · 2025-02-14T15:15:48 1739546148

> Would you want to know if humanity is bound for extinction?

The answer to this question is 100%, "yes". It's just a matter of time.

A better question would be, "How long before humanity goes extinct?" or, "How long before humanity evolves into new species?"

AI might actually be able to give some decent hallucinated answers for those!

duxup · 2025-02-14T16:18:30 1739549910

I think we want Star Trek. Something highly "intelligent", trustworthy, and fast.

But we're getting LLMs trained on cat memes and our own foolishness, and that leads to me agreeing with:

>I don't think that we humans deal too well with the realities of our existence

Yup, it's particularly upsetting.

michaelt · 2025-02-14T15:59:26 1739548766

Thinking of things a bit more broadly than just AI safety, imagine I'm training an LLM. I have 200 thousand ebooks, 1 million arXiv papers, 7 million Wikipedia articles, 300 million Reddit posts, and 500 billion Twitter posts.

If my LLM merely holds up a mirror to society, its output should be a tweet with a trace amount of reddit post mixed in.

Any time an LLM produces more than 140 characters of output, it's because someone like me has decided some data sources are more worthy than others.

That's inherently political, from a certain angle. But it's also important, if you don't want your LLM to advise people to put glue in their pizza sauce.

vaylian · 2025-02-14T15:12:27 1739545947

It's a matter of reputation and potentially liability. The AI is a product of one company and people will associate the "defects" of the AI with that company.

> We're ok that the internet has this content to some extent, it's just understood.

Exactly. ISPs and hosting hardware providers are typically not liable for the content that users share over their infrastructure. In fact, ISPs and hosting providers are invisible to the typical non-technical user.

ninetyninenine · 2025-02-14T15:32:08 1739547128

I can easily explain this paradox.

On the internet when you watch porn the person giving it to you doesn’t give a fuck about serving that content.

On ChatGPT.com the person serving you the LLM gives a shit.

The issue here is you are comparing several singular things with an emergent concept that arises out the interaction of multitudes of things. It’s like saying why is it so paradoxical that when I say hi to a person we expect them to say hi back but if I say hi to the internet we don’t expect the internet to say hi back. Does that make sense? No. That’s also why your observation makes no sense.

What I’m trying to say is. Give LLMs to porn site owners and your paradox is over.

damon_dam · 2025-02-14T15:31:51 1739547111

"Safety" is just an attack vector. AI has a few very motivated enemies with a lot to lose.

Quarondeau · 2025-02-14T14:51:26 1739544686

I find it more disturbing that her private, offline conversations are being policed at all.

duxup · 2025-02-14T15:02:04 1739545324

Most of these AI products are online and sent over the network to generate the response.

Quarondeau · 2025-02-14T17:02:55 1739552575

That's true, but those safeguards can be disabled. ChatGPT on Azure for example, allows Azure account managers to disable filters/safeguards depending on the customer.

Given that this product is apparently used to give people with disabilities a voice, that should definitely qualify. Yes of course they should be able to swear, just like everyone else.

wrs · 2025-02-14T15:50:54 1739548254

So is iMessage. Putting some processing elsewhere on the network doesn’t change the fact that a conversation between spouses (or any other utterances of a person, for that matter) should be private.

duxup · 2025-02-14T16:16:45 1739549805

If you have AI process it .. is it private?

wrs · 2025-02-14T20:46:40 1739566000

Well, that's up to the service provider (just like messaging).

GJim · 2025-02-14T15:39:57 1739547597

Posts made amongst closed groups of friends on social media are now routinely policed by tech companies for "bad" words.

Frankly, I find it "regarded" such puritanism is tolerated.

abeppu · 2025-02-14T15:00:28 1739545228

... is it offline? Is the whole product a use-based API (which might be related to why the voice is slow and she doesn't use it that often)?

DiscourseFan · 2025-02-14T15:10:08 1739545808

"I'm sorry, Karen. I'm afraid I can't do that."

Teever · 2025-02-14T14:33:24 1739543604

My friend uses the speech to text feature on his Android phone it and it routinely censors his profanities like "This is f***g stupid" or whatever.

What I find really disconcerting about that is that there's this sort of implication there that Google would be willing to add a similar misfeature to their onscreen keyboards if they could get away with it.

Why limit the speech to text feature but not the on screen keyboard?

relatedtitle · 2025-02-14T14:39:43 1739543983

I assume it's because speech to text isn't perfectly accurate and Google doesn't want random profanity appearing in inappropriate contexts whenever it does fail.

Hizonner · 2025-02-14T14:41:09 1739544069

Not, in fact, an acceptable excuse. Google, and any whiners who would have a problem with that, need to grow up.

collinmcnulty · 2025-02-14T14:59:42 1739545182

While I agree with you in principle, remember that some users are literally children. Should they have a toggle for “I am an adult”? Yes. But it does make sense to have some accommodation for users who don’t find profanity appropriate yet.

troyvit · 2025-02-14T15:49:20 1739548160

Sure, but one could easily say plenty of threatening, illegal, manipulative things to a child without throwing an f-bomb. In fact getting a child to let down their guard probably works better that way. My only point is that curse words hardly ever hurt a kid, and you don't need curse words to hurt a kid the worst.

blargey · 2025-02-14T15:19:49 1739546389

The kids will be alright if profanity - a crass word - shows up in their autocomplete.

Hizonner · 2025-02-14T15:37:38 1739547458

Where did you live that you were never exposed to profanity as a child?

Good thing it's harmless.

erikerikson · 2025-02-14T15:52:23 1739548343

It may be harmless in some regards but in some social spheres it will be punished (e.g. kids bring punished for swearing at school) and in others the consequence of swearing will be silent exclusion (e.g. not being invited to meetings or asked to lead efforts).

Those seem like harms to many regardless of feelings about language restriction.

Hizonner · 2025-02-14T16:06:58 1739549218

I swear like a your-favorite-stereotype. When my kid was maybe 4, I told her "I don't care if you talk like I do, but people will give you trouble for it". Probably with more detail than that; it's been years now.

She fully understood the point, right then. She also had no problem with other advice about how people would react to whatever.

She's 17 now. I actually don't think I've ever heard her utter a "swear word".

Kids, in general, have no problem with the idea of social context.

baobabKoodaa · 2025-02-14T15:15:23 1739546123

MY GOD! Think of the children!

rsynnott · 2025-02-14T14:42:22 1739544142

> What I find really disconcerting about that is that there's this sort of implication there that Google would be willing to add a similar misfeature to their onscreen keyboards if they could get away with it.

Remember the old iPhone 'duck' autocorrect issue?

imglorp · 2025-02-14T14:51:22 1739544682

Dear autocorrect - it's never "duck".

danbee · 2025-02-14T15:21:00 1739546460

For years my iPhone would try to autocorrect "mum" to "nun".

tdeck · 2025-02-14T15:24:27 1739546667

https://m.youtube.com/watch?v=6hcoT6yxFoU

christkv · 2025-02-14T15:01:59 1739545319

Thats ducking stupid

lupusreal · 2025-02-14T14:37:33 1739543853

I bet it wouldn't ban for "bum", which is less offensive to Americans but potentially moreso for the sorts of anglophones who are likely to say arse.

Why are we still treating words like everybody has mid-20th century sensitivities? Shit, fuck and ass are all mild words in modern parlance but American tech companies are totally out of touch.

snarf21 · 2025-02-14T15:26:11 1739546771

It is as simple as money. More open expression is nothing compared appeasing the puritanical powers that be. Don't put anything in your product that would stop adoption or prevent people giving you money.

lupusreal · 2025-02-14T17:13:28 1739553208

Do the "puritanical powers that be" still even exist? I think you'll have to look in nursing homes to find many Americans who are genuinely scandalized a bit of standard cussing. The "bad words" which are actually taboo in this century are slurs and the like.

Companies probably lose more people by banning cursing than would be driven away by cursing.

GJim · 2025-02-14T21:49:38 1739569778

> standard cussing...... slurs and the like.

And the difference is?

Ergo, what is considered offensive is based on social construct. (In more religious times, "god damn you" was heinous insult, which I doubt would register with anybody in modern secular Blighty.)

lupusreal · 2025-02-14T22:43:32 1739573012

The difference is how they are presently perceived. I am arguing that shit, fuck, etc are not presently taboo, while other words (slurs) are. Yet American companies treat words like fuck as though they are still widely considered offensive. These companies are out of touch with modern culture; that's my point.

So what is your point? Why do you feel the need to tediously explain that offensive words are a social construct, something obviously understand already because I just got done explaining that the set of taboo words has changed over time?

samsk · 2025-02-14T15:04:55 1739545495

Well saying 'arse' is very dangerous, not like some lies that i.e. politician tell repeatedly to milions.

Makes totaly sense...to someone...maybe...

rcarmo · 2025-02-14T16:22:26 1739550146

Better than using the word “tariffs” these days, though…

kstrauser · 2025-02-14T15:21:38 1739546498

Related: https://news.ycombinator.com/item?id=43022398

This is the same ElevenLabs we were talking about here a couple days ago. That’s one app I don’t have to spend time playing with now.

rcarmo · 2025-02-14T16:20:57 1739550057

As a European who spends most of his time speaking English and have witnessed an entire spectrum of profanity from both British and American folk, I find this stupefyingly ridiculous and bearing on the hypocritical.

It’s true that the British (and their antipodals, the Ozzie’s - hi Mike!) use very colorful language, but I’ve been in calls with US folk who beat them by a (country) mile, if you know what I mean.

Perhaps the biggest cultural difference is informality vs, well, outright insult and abuse, but I’ve found that US folk tend to abuse power dynamics and compound them with swearing whereas the Brits manage to make it seem like an endearment.

Still, this is a profoundly stupid thing for Elevenlabs to do, AI safety or otherwise.

stevev · 2025-02-14T14:41:02 1739544062

Big tech is working on a few models that should beat ElevenLabs in terms of pricing and quality. Eventually Deepseek will opensource theirs and cause ElevenLabs to be sold to Disney to stay relevant or something.

Hizonner · 2025-02-14T14:43:17 1739544197

But you'll have to fine tune it for a week to get it to say "Tiananmen Square".

robocat · 2025-02-14T15:03:26 1739545406

Is the English pronunciation close enough to the Mandarin (天安门广场) for the word to even be understood by someone who only speaks Mandarin?

I presume the two words are different enough that they have different censoring rules (especially since square is an English word).

I'm surprised it hasn't been renamed.

palunon · 2025-02-14T15:35:30 1739547330

If you look at the significance of the place (The Tian'anmen is literally on the national emblem of China, and the tomb of Mao is on the square for example), it's hard to rename something that widely known. It's much easier to pull one of the events that happen there under the rug, because unlike in the west the name is associated to much more.

krunck · 2025-02-14T17:17:54 1739553474

That's ok. Soon these voice models will be available running locally on your device and ElevenLabs will be looking for new niche to screw up.

hiccuphippo · 2025-02-14T14:47:39 1739544459

So everything they say through the app gets reviewed by another person? Terrifying.

stuaxo · 2025-02-14T14:57:01 1739545021

Bank the AI voice at this point.

blame-troi · 2025-02-14T14:38:25 1739543905

Oh come on! Let my people curse! Minority Report meets New Speak…