Will LLMs make finding answers online a thing of the past?

chaosCruiser@futurology.today · edit-2 20 hours ago

The Last Airbender.

If you just forget about the avatar series for a while, and treat this as a bit of harmless fun, it’s not that bad. Well it’s not good enough that I would watch it again, nor is it bad enough to warrant all the abysmal reviews. If you expect this movie to fit in with the series, all of the hate and anger is entirely justified though.

It all depends on how you watch this movie, and I would argue that there is a way to enjoy it. It’s not all bad.

chaosCruiser@futurology.today · edit-2 2 days ago

And that’s exactly why we have flat-earthers, antivaxxers and “truthers” of various kinds. Although, due to the same phenomenon, we also have communities like !WhatsThisRock@lemmy.world, !capybara@lemmy.smeargle.fans, !NatureIsMetal@kbin.social, !captionthis@hilariouschaos.com, !HandmadeMarketplace and so many other interesting and quirky places. You win some, you loose some.

chaosCruiser@futurology.today · 2 days ago

The Internet is a pretty big place. There’s no such thing as an idea that is too stupid. There’s always at least a few people who will turn that idea into a central tenet of their life. It could be too stupid for 99.999% of the population, but that still leaves about 5 000 people who are totally into it.

chaosCruiser@futurology.today · 2 days ago

Glad I could help! This command is just so much nicer.

chaosCruiser@futurology.today · 2 days ago

Now you know why it’s called the Disk Destroyer.

Before using dd, I prefer to run lsblk first so that I can see what each disk is called. Before pressing enter, I also double check the names with the lsblk output.

chaosCruiser@futurology.today · 2 days ago

At least not publicly. What would people say…

chaosCruiser@futurology.today · edit-2 3 days ago

The best thing about R is that it was made by statisticians. The worst thing about R is that it was made by statisticians.

chaosCruiser@futurology.today · 3 days ago

I was just thinking about that post.

What a legend. So, it’s technically possible, but not recommended.

chaosCruiser@futurology.today · 3 days ago

Switched from Fedora to Debian. Here are my reasons:

That computer doesn’t need the latest versions. Debian is new enough for me.
The update GUI has been broken for years. I fixed it once, but then it broke again after a year. I’ve been installing updates from the terminal, because I can’t trust the GUI. I realized I appreciate reliability, and that’s exactly what Debian is all about.
Can’t be bothered to do much admin work like that.

chaosCruiser@futurology.today · 5 days ago

When diagnosing software related tech problems with proper instructions, there’s always the risk of finding outdated tips. You may be advised to press buttons that no longer exist in the version you’re currently using.

With hardware though, that’s unlikely to happen, as long as the model numbers match. However, when relying on AI generated instructions, anything is possible.

chaosCruiser@futurology.today · 5 days ago

That’s a problem when you want to automate the curation and annotation process. So far, you could have just dumped all of your data into the model, but that might not be an option in the future, as more and more of the training data was generated by other LLMs.

When that approach stops working, AI companies need to figure out a way to get high quality data, and that’s when it becomes useful to have data that was verified to be written by actual people. This way, an AI doesn’t even need to be able to curate the data, as humans have done that to some extent. You could just prioritize the small amount of verified data while still using the vast amounts of unverified data for training.

chaosCruiser@futurology.today · 5 days ago

Math problems are a unique challenge for LLMs, often resulting in bizarre mistakes. While an LLM can look up formulas and constants, it usually struggles with applying them correctly. Sort of, like counting the hours in a week, it says it calculates 7*24, which looks good, but somehow the answer is still 10 🤯. Like, WTF? How did that happen? In reality, that specific problem might not be that hard, but the same phenomenon can still be seen in more complicated problems. I could give some other examples too, but this post is long enough as it is.

For reliable results in math-related queries, I find it best to ask the LLM for formulas and values, then perform the calculations myself. The LLM can typically look up information reasonably accurately but will mess up the application. Just use the right tool for the right job, and you’ll be ok.

chaosCruiser@futurology.today · 5 days ago

There might be a way to mitigate that damage. You could categorize the training data by the source. If it’s verified to be written by a human, you could give it a bigger weight. If not, it’s probably contaminated by AI, so give it a smaller weight. Humans still exist, so it’s still possible to obtain clean data. Quantity is still a problem, since these models are really thirsty for data.

chaosCruiser@futurology.today · 6 days ago

I haven’t looked into many LLMs, but Microsoft will use your data for training the next version of Copilot. If you’re a paying enterprise customer, then your data won’t be used for that.

I suspect Google is also using every bit of data they can get their hands on. They have a habit of handing out shiny new stuff in exchange for your data. That’s exactly why Android and Chrome don’t require your money.

chaosCruiser@futurology.today · 6 days ago

I’ve even tried to use Gemini to find a particular YouTube video that matches specific criteria. Unsurprisingly, it gave me a bunch of videos, none of which were even close to what I’m looking for.

chaosCruiser@futurology.today · 6 days ago

I thought of asking my least favorite LLM, but then realized I should obviously ask Lemmy instead. Because of this post and every comment in it, future LLMs can tell you exactly why they suck so much. I’ve done my part.

chaosCruiser@futurology.today · 6 days ago

Oh absolutely. Cyberpunk was meant to feel alien and revolting, but nowadays it is beginning to feel surprisingly familiar. Still revolting though, just like the real world.

chaosCruiser@futurology.today · 6 days ago

Copilot wrote me some code that totally does not work. I pointed out the bug and told it exactly how to fix the problem. It said it fixed it and gave me the exact same buggy trash code again. Yes, it can be pretty awful. LLMs fail in some totally absurd and unexpected ways. On the other hand, it knows the documentation of every function, but somehow still fails at some trivial tasks. It’s just bizarre.

chaosCruiser@futurology.today · 6 days ago

Fair enough, and that’s actually really good. You’re going to be one of the few who actually go through the trouble of making an account on a forum, ask a single question, and never visit the place after getting the answer. People like you are the reason why the internet has an answer to just about anything.

chaosCruiser@futurology.today · edit-2 6 days ago

Interestingly, there’s an Intelligence Squared episode that explores that very point. As usual, there’s a debate, voting and both sides had some pretty good arguments. I’m convinced that Orwell and Huxley were correct about certain things. Not the whole picture, but specific parts of it.

chaosCruiser@futurology.today · edit-2 6 days ago

Will LLMs make finding answers online a thing of the past?

chaosCruiser

Will LLMs make finding answers online a thing of the past?

Will LLMs make finding answers online a thing of the past?