The catarrhine yerba mate enjoyer who invented a perpetual motion machine, by dreaming at night and devouring its own dreams through the day.

Кўис кредис ессе, Беллум?

  • 3 Posts
  • 144 Comments
Joined 3 years ago
cake
Cake day: April 9th, 2021

help-circle


  • The source that I’ve linked mentions semantic embedding; so does further literature on the internet. However, the operations are still being performed with the vectors resulting from the tokens themselves, with said embedding playing a secondary role.

    This is evident for example through excerpts like

    The token embeddings map a token ID to a fixed-size vector with some semantic meaning of the tokens. These brings some interesting properties: similar tokens will have a similar embedding (in other words, calculating the cosine similarity between two embeddings will give us a good idea of how similar the tokens are).

    Emphasis mine. A similar conclusion (that the LLM is still handling the tokens, not their meaning) can be reached by analysing the hallucinations that your typical LLM bot outputs, and asking why that hallu is there.

    What I’m proposing is deeper than that. It’s to use the input tokens (i.e. morphemes) only to retrieve the sememes (units of meaning; further info here) that they’re conveying, then discard the tokens themselves, and perform the operations solely on the sememes. Then for the output you translate the sememes obtained by the transformer into morphemes=tokens again.

    I believe that this would have two big benefits:

    1. The amount of data necessary to “train” the LLM will decrease. Perhaps by orders of magnitude.
    2. A major type of hallucination will go away: self-contradiction (for example: states that A exists, then that A doesn’t exist).

    And it might be an additional layer, but the whole approach is considerably simpler than what’s being done currently - pretending that the tokens themselves have some intrinsic value, then playing whack-a-mole with situations where the token and the contextually assigned value (by the human using the LLM) differ.

    [This could even go deeper, handling a pragmatic layer beyond the tokens/morphemes and the units of meaning/sememes. It would be closer to what @[email protected] understood from my other comment, as it would then deal with the intent of the utterance.]


  • Not quite. I’m focusing on chatbots like Bard, ChatGPT and the likes, and their technology (LLM, or large language model).

    At the core those LLMs work like this: they pick words, split them into “tokens”, and then perform a few operations on those tokens, across multiple layers. But at the end of the day they still work with the words themselves, not with the meaning being encoded by those words.

    What I want is an LLM that assigns multiple meanings for those words, and performs the operations above on the meaning itself. In other words the LLM would actually understand you, not just chain words.


  • Complexity does not mean sophistication when it comes to AI and never has and to treat it as such is just a forceful way to make your ideas come true without putting in the real effort.

    It’s a bit off-topic, but what I really want is a language model that assigns semantic values to the tokens, and handles those values instead of directly working with the tokens themselves. That would be probably far less complex than current state-of-art LLMs, but way more sophisticated, and require far less data for “training”.








  • It’s less complicated than it looks like. The text is just a poorly written mess, full of options (Fedora vs. Ubuntu, repo vs. no repo, stable vs. beta), and they’re explaining how to do this through the terminal alone because the interface that you have might be different from what they expect. And because copy-pasting commands is faster.

    Can’t I just download a file and install it? I’m on Ubuntu.

    Yes, you can! In fact, the instructions include this option; it’s under “Installing the app without the Mullvad repository”. It’s a bad idea though; then you don’t get automatic updates.

    A better way to do this is to tell your system “I want software from this repository”, so each time that they make a new version of the program, yours get updated.

    but I have no idea what I’m doing here.

    I’ll copy-paste their commands to do so, and explain what each does.

    sudo curl -fsSLo /usr/share/keyrings/mullvad-keyring.asc https://repository.mullvad.net/deb/mullvad-keyring.asc
    echo "deb [signed-by=/usr/share/keyrings/mullvad-keyring.asc arch=$( dpkg --print-architecture )] https://repository.mullvad.net/deb/stable $(lsb_release -cs) main" | sudo tee /etc/apt/sources.list.d/mullvad.list
    sudo apt update
    sudo apt install mullvad-vpn
    

    The first command boils down to “download this keyring from the internet”. The keyring is a necessary file to know if you’re actually getting your software from Mullvad instead of PoopySoxHaxxor69. If you wanted, you could do it manually, and then move to the /usr/share/keyrings directory, but… it’s more work, come on.

    The second command tells your system that you want software from repository.mullvad.net. I don’t use Ubuntu but there’s probably some GUI to do it for you.

    The third command boils down to “hey, Ubuntu, update the list of packages for me”.

    The fourth one installs the software.


  • Lvxferre@lemmy.mltoLinux@lemmy.ml*Permanently Deleted*
    link
    fedilink
    arrow-up
    10
    arrow-down
    7
    ·
    6 months ago

    The first response seems reasonable for me; it’s informative and replying to an ambiguous comment, as you can’t quite know if “isn’t there” refers to his individual needs or in general.

    The second response is however passive aggressive garbage. Fl4ppers clarified that he was talking about his individual needs; notjustforhackers failed to take it into account, and his response sounds a lot like “I’m just sayin lol lmao… you liar”.




  • Then join an instance where scores are disabled if you don’t like them. :shurg:

    Already addressed - a lot of those issues will still affect you, even if you don’t use the karma system.

    Let’s say that instances A (karma disabled) and B (karma enabled) federate. A users won’t get the karma system itself, but they’ll still get: less varied and less interesting content, stronger echo chambers, and higher concentration of users in oversized and unruly comms. Because they use the same comms as the B users and thus the behaviour of B users affect A.

    Choosing an instance where downvotes are disabled is already a preference, so making the score aggregates optional is completely in line with that.

    Downvotes are a mixed feature, with pros and cons.
    Karma looks good from a distance, but upon closer inspection it’s only cons. (Including enabling shitty=assumptive mod practices.)

    You’re already on .ml, so…

    I am clearly not talking about my individual usage here. I’m talking about users in general and the Lemmyverse as a whole.

    The whole shtick of Lemmy is run your instance the way you want to run it.

    I’m not sure on what’s supposed to be the [ipsis digitis] “whole shtick of Lemmy”, and I’m not assuming it.

    The removal of the scores from the API seems [for me] heavy-handed and feels [for me] like the devs are forcing their preferences/values on others.

    For me it looks like a sensible decision that takes into account its impact into users and the Lemmyverse.

    EDIT: I’ll go further. Dunno if the devs agree with this or not, but I believe that “user aggregate score” = karma also attracts and retains users with the wrong mindset - who are not here to share, contribute or be part of something social and collective; but instead to farm virtual e-peen points for the sake of their individual egos. And I believe that this “it’s all about MEEE! ME! ME!” mindset is part of what makes Reddit such a dumpster fire.


  • I’m a nobody, but I’m officially supporting this decision of the devs to remove karma (user score aggregates) from the API. Because karma brings on a plethora of problems¹:

    • It is gamification of the system. As hinted by their PR, this is not healthy.
    • It leads to less varied and less interesting content, due to the fluff principle.
    • It feeds echo chambers, by giving people yet another reason to not confront them, even when moral and sensible to do so.
    • It shifts the focus from the content to the people, detracting from the experience of what boils down to a bunch of forums.
    • It is yet another reason for people to congregate in oversized and unruly communities, instead of splitting into smaller ones.

    Re-enable it at the API level and continue hiding it in Lemmy-UI if that is your personal stance on the matter.

    A lot of those issues will affect negatively your user experience, regardless of you using the karma feature or not. Simply because other people use it.

    And it’s also the sort of "lead acetate"² feature that makes clueless users annoy the shit out of interface developers, until they add it. “I dun unrurrstand, y u not enable karma? Y u’re app defective lol l mao” style. With app devs eventually caving in.

    As such, “leave it optional” is probably a bad approach.

    Considering how easy it is to spin up troll accounts or amass multiple troll accounts across multiple instances, removing a useful metric for identifying them at a glance is, IMO, irresponsible.

    This is a poor argument. It has some merit in Reddit³, but not in Lemmy.

    You aren’t identifying trolls by karma. You’re assuming that someone is a troll, based on a bad correlation. Plenty users get low karma for unrelated reasons (false positive - e.g. newbie user unknowingly violating some “unspoken rule” of the local echo chamber), and plenty trolls get past your arbitrary karma wall³ (false negative).

    So relying on karma to decide who’s a troll is not as effective as it looks like, and it’s specially unfair to newcomers, thus discouraging the renovation of the community. IMO it’s a damn shitty moderator practice.

    Since trolling is mostly an issue when you get the same obnoxious troll[s] coming back over and over and over, under new accounts, to post gaping anuses again, and mods have no way to detect if the troll came back, mods should be upstreaming this issue to the admins of the instance of their comm - because the admins likely have access to your IP⁴, and can prevent the user from creating a new trolling account every 15 days.

    And, if for some reason the admins are uncaring or uncooperative, the mods should be migrating the comm to another instance.

    What Lemmy needs is not to enable shitty moderation practices. It needs better mod tools to enable good moderation practices:

    • the context of the content being reported should be immediately obvious, no clicks needed
    • there should be a quick way to check all submissions/comments of a user to your community
    • there should be a way to keep notes about users, and share them with the rest of the mod team
    • some automod functionality. Such as automatically reporting (not removing!) content or replying to the user based on a few criteria defined by the mods.

    e.g. #2: If someone posts a particularly toxic comment but their score is high, I’m more likely to read through their history and conclude they’re having a bad day or something. Without the score, I will not read through and likely just ban them and move on.

    IMO this is also a shitty moderation practice. Should I go further on that? [Serious/non-rhetorical question.]

    NOTES:
    1. Since this is already a huge wall of text I didn’t go deep on each of those claims, but I can do so if desired/requested.
    2. It’s sweet but poisonous.
    3. Because in Reddit you can’t “migrate your sub to another Reddit instance”, and the only instance there happens to be administered by arsehats who give no fucks about you or your sub. It’s a dirtier situation that warrants dirtier solutions.
    4. Anecdote exemplifying this claim: from 2020~22 I had multiple trolling accounts in Reddit, to shitpost in cooking subs (for some puzzling reason they’re cesspools). Guess how many times this sort of “you need more karma to post here” barrier locked me out? Zero. It’s simply too easy to comment some shitty one-line in a big community (I used r/askreddit for that) and amass 500, sometimes 2k karma points in a single go.
    5. If instance admins do not have access to the IPs of the users engaging with their instances, regardless of where they registered in, that should be fixed.


  • Sorry for the wall of text.

    I honestly do not think that your judgment was accurate in this situation, and I think that you jumped the gun; the poster sounds genuinely clueless. However I’m fully aware that I don’t have full access to all the info necessary to conclude shite here.

    Large bans don’t decrease your workload, they increase it.

    Trolls and bad faith agents might wait for a short ban to expire, but they won’t wait for a large ban - they’ll evade it with an alt account and call it a day, and now you’re playing whack-a-mole with them. With a permaban at least you’re telling them to fuck off, even if they won’t listen.

    For more sensible users, the large ban is unfair, and conveys “we still want you here… but we’re too lazy to deal with you thing right now, so shoo”. Other users are not blind, they will notice that the mods overreact to rule infractions and they will avoid reporting things, except for petty reasons. Now you’re bound to fine-comb threads manually to enforce the rules because nobody is reporting shite.

    Either way, you’re doing more work than you would otherwise.

    A better approach here would be to contain content prone to trigger rule-breaking comments. Megathreads work like a charm for that; they allow you to fine-comb a single thread instead of the whole community. It also helps to bring up the content diversity of the community.

    Another thing. I do agree with you that automatically tying that chant to Antisemitism is itself Antisemitic; however you’re taking for granted that all users are on the same page when it comes to that, and both of us know that the media is spamming them with misinformation that conflates Israel with Jewish people. In those situations it’s better to issue an official statement, explaining what will be considered Antisemitism for the sake of rule enforcement. (It helps to inform other users too.)


  • Let’s roll with your interpretation that the slogan is solely Antizionist. That would make the poster misinformed and incorrect; in this situation, the right thing to do is to talk with the poster, informing them, while checking their profile for potential Antisemitic activity. This also works great when the user is not rational (i.e. a bad faith agent) because it gives you better grounds for a ban.

    Another issue that I see is ban length. A short ban is great as a warning, or to tell the user to cool their head; while permaban is great when you want to convey “we the mod team do not you here, fuck off”. A two months ban is the worst of both worlds.