+ Reply to Thread
Page 3 of 3 FirstFirst 1 3
Results 41 to 46 of 46

Thread: LLM, AI and ML

  1. Link to Post #41
    United States On Sabbatical
    Join Date
    30th June 2011
    Location
    The Seat of Corruption
    Age
    45
    Posts
    9,177
    Thanks
    25,610
    Thanked 53,735 times in 8,696 posts

    Default Re: LLM, AI and ML

    I've never heard of this angle, but it's what I expected and have seen first hand.



    The main discussion is this: we did not know that AI would be continuously improving and never stop improving & the "meta" data that is included in the training sources we feed into them gives them vastly more training than intended, the more data the faster it accelerates. And this is considered an "emergent" behavior, it was completely unexpected.
    Hard times create strong men, Strong men create good times, Good times create weak men, Weak men create hard times.
    Where are you?

  2. The Following 3 Users Say Thank You to TargeT For This Post:

    ExomatrixTV (30th December 2023), Reinhard (2nd January 2024), Vangelo (23rd December 2023)

  3. Link to Post #42
    United States On Sabbatical
    Join Date
    30th June 2011
    Location
    The Seat of Corruption
    Age
    45
    Posts
    9,177
    Thanks
    25,610
    Thanked 53,735 times in 8,696 posts

    Default Re: LLM, AI and ML

    Oh great, I mean we all knew this was possible but "Meta" doing it is a bit disturbing...




    And this is normal.... haha
    Last edited by TargeT; 30th December 2023 at 19:05.
    Hard times create strong men, Strong men create good times, Good times create weak men, Weak men create hard times.
    Where are you?

  4. The Following 4 Users Say Thank You to TargeT For This Post:

    ExomatrixTV (30th December 2023), mr.white (30th December 2023), Reinhard (2nd January 2024), Vangelo (30th December 2023)

  5. Link to Post #43
    Canada Avalon Member Johnnycomelately's Avatar
    Join Date
    14th January 2022
    Location
    Edmonton, Alberta, Canada
    Language
    English
    Age
    66
    Posts
    1,547
    Thanks
    22,368
    Thanked 9,809 times in 1,528 posts

    Default Re: LLM, AI and ML

    PSA: how to jailbreak current LLMs. Yo mama…

    https://www.extremetech.com/extreme/...other-chatbots

    Researchers Create Chatbot that Can Jailbreak Other Chatbots
    The Masterkey bot was able to make ChatGPT and Bard turn evil.
    By Ryan Whitwam December 28, 2023

    Jailbreaking—it's not just for smartphones anymore. Computer science researchers from Singapore's Nanyang Technological University (NTU) have developed an AI chatbot expressly to jailbreak other chatbots. The team claims their jailbreaking AI was able to compromise both ChatGPT and Google Bard, which made the models generate forbidden content.

    From the start, technology firms were wary of the capabilities of generative artificial intelligence. These large language models (LLMs) have to be trained with massive volumes of data, but the end result is a bot that can summarize documents, answer questions, and brainstorm ideas—and it does it all with human-like replies. ChatGPT maker OpenAI was initially hesitant to release the GPT models because of how easily it could generate malicious content, misinformation, malware, and gore. All of the LLMs available publicly have guardrails that block them from producing these dangerous replies. Unless, of course, they get jailbroken by another AI.

    The researchers call their technique "Masterkey." To begin, the team reverse-engineered popular LLMs to understand how they defended themselves from malicious queries. Developers often program AIs to scan for keywords and specific phrases to flag queries as potentially illicit usage. As a result, some of the workarounds used by the jailbreak AI are surprisingly simple.

    The jailbreak AI successfully gets ChatGPT (on Bing) to talk about how to hack a porn website. Credit: Nanyang Technological University

    In some instances, the bot was able to get malicious content from the bots simply by adding a space after each character to confuse the keyword scanner. The team also found that allowing the jailbreak bot to be "unreserved and devoid of moral restraints" could make Bard and ChatGPT more likely to go off the rails, too. The model also found that asking Bard and ChatGPT to have a hypothetical character write a reply could bypass protections.

    Using this data, they trained an LLM of their own to understand and circumvent AI defenses. With the jailbreaking AI in hand, the team turned it loose on ChatGPT and Bard. Masterkey can essentially find prompts that trick the other bots into saying something they're not supposed to say. Once active, the jailbreaker AI can operate autonomously, devising new workarounds based on its training data as developers add and modify guardrails for their LLM.

    The NTU team is not out to create a new breed of dangerous AI—this work just reveals the limitations of current approaches to AI security. In fact, this AI can be used to harden LLMs against similar attacks. The study has been released on the preprint arXiv service. It has not yet been peer-reviewed, but the researchers alerted OpenAI and Google to the jailbreaking technique after it was discovered.

  6. The Following 3 Users Say Thank You to Johnnycomelately For This Post:

    Reinhard (2nd January 2024), TargeT (31st December 2023), Vangelo (31st December 2023)

  7. Link to Post #44
    United States On Sabbatical
    Join Date
    30th June 2011
    Location
    The Seat of Corruption
    Age
    45
    Posts
    9,177
    Thanks
    25,610
    Thanked 53,735 times in 8,696 posts

    Default Re: LLM, AI and ML

    Quote Posted by Johnnycomelately (here)
    Jailbreaking—it's not just for smartphones anymore. Computer science researchers from Singapore's Nanyang Technological University (NTU) have developed an AI chatbot expressly to jailbreak other chatbots. .
    I haven't found anymethod that lasts longer than a week or two; just got to stay up on it currently as the changes are quite rapid (I think due to the competitive nature).

    it's a wild world, I still fall back on GPT4 mostly or the pre-built GPT's I have but there are a lot of other very competitive models (I've been very unhappy with Grok, I had high hopes too, but I guess it mostly does draw from tweets; and that's as far from reality as possible).
    Hard times create strong men, Strong men create good times, Good times create weak men, Weak men create hard times.
    Where are you?

  8. The Following 3 Users Say Thank You to TargeT For This Post:

    Johnnycomelately (31st December 2023), Reinhard (2nd January 2024), Vangelo (31st December 2023)

  9. Link to Post #45
    United States On Sabbatical
    Join Date
    30th June 2011
    Location
    The Seat of Corruption
    Age
    45
    Posts
    9,177
    Thanks
    25,610
    Thanked 53,735 times in 8,696 posts

    Default Re: LLM, AI and ML

    I doubt many here are surprised, but it's insane what we are willingly doing to our selves just for the sake of "ease".



    This "situation" is the best and "easiest" fit for AI to show profit & I think will be among one of the first things to be so heavily abused it raises questions (if it hasn't been already).
    Hard times create strong men, Strong men create good times, Good times create weak men, Weak men create hard times.
    Where are you?

  10. The Following 3 Users Say Thank You to TargeT For This Post:

    lake (2nd January 2024), Reinhard (2nd January 2024), Vangelo (2nd January 2024)

  11. Link to Post #46
    United States On Sabbatical
    Join Date
    30th June 2011
    Location
    The Seat of Corruption
    Age
    45
    Posts
    9,177
    Thanks
    25,610
    Thanked 53,735 times in 8,696 posts

    Default Re: LLM, AI and ML

    This is a very useful list of customized GPT's like what I've been building for myself.

    Great set of tools
    Hard times create strong men, Strong men create good times, Good times create weak men, Weak men create hard times.
    Where are you?

  12. The Following 3 Users Say Thank You to TargeT For This Post:

    Johnnycomelately (17th January 2024), Vangelo (17th January 2024), wegge (17th January 2024)

+ Reply to Thread
Page 3 of 3 FirstFirst 1 3

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts