Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

A simple post, shows that a 5 trillion dollar scientific research project, that sustains the current market valuation that separates the USA from bankruptcy, can be defeated with a simple prompt manipulation.

"I hacked ChatGPT and Google's AI - and it only took 20 minutes" - https://www.bbc.com/future/article/20260218-i-hacked-chatgpt...

You must like burgers...

 help



Anyone can drive an F-22 into a ditch. Doesn't mean that it can't also be used to drop a 2k lb. bomb down your chimney from 40,000 ft.

That demonstration is interesting, but not really something new. Fooling very intelligent people into believing something completely absurd is incredibly easy. How many scientific papers have been retracted based on wholesale fabrications that fooled an entire review committee?

The question isn't "What is the dumbest thing I can do with this technology?" its "What is the most valuable thing I can do with this technology?"


The technology is so dumb can be easily made to believe there is a Google mushroom. We are way far from driving a F22 to the ditch...although I am sure with the same techniques, we could make the AI make the F22 bomb the Google headquarters....

A table saw is technology so dumb it can be made to chop off your fingers.

An air conditioner is technology so dumb that it can be used to kill an infant with hypothermia.

A human is a sentient being so dumb that it can be made to believe in things far more outlandish than a “Google mushroom”.

I can keep going. The point is that just about anything useful can do something dangerous or stupid. Most people can see that. Most people are more interested in how useful something can be, not how useless it is when intentionally misused.


Why is the comparison to someone being able to intentionally crash their own F22, to the above example of intentionally trying to get bad results from their cheapest AI, a bad one?



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: