BLUE

Dr. Johnathan Flowers, Blade Wielding Bisexual (Lordean arc)

@shengokai.bsky.social

Martial artist, motorcyclist, pragmatist, comics philosopher, queer phenomenologist. Assistant Professor @CSUN. Japanese philosophy, race, gender, disability, tech/AI. Inquiries: johnathan.flowers@csun.edu Rider of the mind shaitan. He/Him/His 🏳️‍🌈

8.5k followers1.2k following10.5k posts

DJshengokai.bsky.socialSep 18, 2024 11:23pm

Come here, y'all, and let me tell you how this gets so much worse. So, I think it was either in 2017 or 2018 that a bunch of scholars demonstrated that BERTs and GPTs deployed in automated moderation labeled tweets from Black users as "rude" 50% more often than tweets from white users.

EBerinbiba.bsky.socialSep 18, 2024 8:28pm

Raise your hand if you've had the "rude" label applied to you while defending your marginalized identity against an abusive person 🙋🏻‍♀️ The "rude" label should not exist. It is *subjective* and easily weaponized by bigots and bad actors. Suspension based on rude labels is going to CAUSE HARM.

QJquiet-julia.bsky.socialSep 19, 2024 12:53am

For instance, I received a three day ban instead of a total ban on Reddit for the following after the first assassination attempt occurred. “I was extremely disappointed to hear that the person who attempted to assassinate the 🍊🤡 failed in his endeavour.”

QJquiet-julia.bsky.socialSep 19, 2024 12:49am

I think the trick here is to blast them so eloquently that BSky can’t place a rude label on you. Replies like that are what I constantly strive to accomplish.

DJshengokai.bsky.socialSep 18, 2024 11:24pm

The authors of the study concluded that the nature of the LLMs (and this is pre-Stochastic Parrots) meant that they could not understand the context of the speech used and were subsequently making moderation decisions on the basis of what we in philosophy would call "received norms" of conduct.

Jjucifer.bsky.socialSep 18, 2024 11:50pm

📌

Dr. Johnathan Flowers, Blade Wielding Bisexual (Lordean arc)

@shengokai.bsky.social

8.5k followers1.2k following10.5k posts