Oops, something went wrong!

perishthethought@lemm.ee · edit-2 10 months ago

Oops, something went wrong!

MercuryGenisus@lemmy.world · 10 months ago

As a developer of many years I hate to tell you sometimes that it’s all the information we have when something breaks also. Most code is a god awful mess. Thankfully I love a good mystery.

AngryCommieKender@lemmy.world · 10 months ago

Please apply to Crowd Control Productions. You’ll never find a better mystery than current live code that has spaghetti that traces back to 2002 or 2001. The game went live in 2003. There’s one, kinda, server. Technically there are three, but most EvE players only have access to Tranquility, most don’t have access to Singularity, and apparently they have renamed the dev server from Multiplicity to Serinity. Some of us still have access to Serinity because we were able to play test Multiplicity, back in the day.

NιƙƙιDιɱҽʂ@lemmy.world · 10 months ago

I hate the process of debugging, but good god, once you’ve spent 5 hours tracking down the dumbest shit and gotten it to work, it’s better than an orgasm.

ILikeAllAss@lemmynsfw.com · 10 months ago

I agree that it’s (weirdly) uncommon to be the one saying “please give me more info about the error!”

A simple error code can be endlessly helpful (bonus points if there’s a corresponding support article explaining common codes)

Even if some codes are only useful to internal support, it’s handy to be able to search an error code and see “oh I can just jump straight to submitting a ticket/calling their support” or “oh, this fix might work”

Console_Modder@sh.itjust.works · 10 months ago

This is the only error message I need

https://youtu.be/fFtc8zeI6F8

Kissaki@feddit.org · 10 months ago

Ok, I’ll replace my error pages with a redirect to that

jordanlund@lemmy.world · 10 months ago

Ringtone for when work calls/texts:

https://youtu.be/t3otBjVZzT0

Monkey With A Shell@lemmy.socdojo.com · 10 months ago

Most common place I see those is when the site doesn’t want to talk to a known VPN endpoint. Like another mentioned site owners don’t want to given any more info to what they perceive as a possible threat than needed, so they just give a generic failure page.

obrenden@lemmy.world · 10 months ago

Reminds me of the old “Oopsie woopsie we made a fucky wucky!!” post

https://knowyourmeme.com/memes/oopsie-woopsie

Kissaki@feddit.org · 10 months ago

hperrin@lemmy.ca · 10 months ago

What are you planning to do with information about the error? It’s not like these places have customer support. Usually it’s something like a caching layer failing, and there’s literally nothing you can do about that.

Nouveau_Burnswick@lemmy.world · 10 months ago

If it’s an error code I’ve worked around before, apply same troubleshooting.

If its a new errror code, search the error code to see how other people solved it.

If no one else has solved the error code, try analogous troubleshooting, post results online with the error code name, successful or not.

hperrin@lemmy.ca · 10 months ago

These kinds of error messages are almost exclusively used for transient errors. You aren’t going to work around a transient error. The best thing you can do (the only thing you can do, really) is to try again later, hence, the message. It’s not helpful to show you a message like “cache-1234.example.com failed to respond within 300 milliseconds”. What are you going to do about that? By the time you submit a support ticket, that host has already been brought back up automatically. So now you’ve just wasted your time and the support staff’s time. The engineers already have a log of that error and a log of whatever error brought down that host, so you’re not telling them anything new by making a support ticket.

perishthethought@lemm.ee · 10 months ago

I agree with Nouveau_Brunswick here.

And to add: to @[email protected] , are you not also a user of software and do you not see room for improvement in many apps? That’s where I am rn: I just want them to try harder to communicate a tiny bit more info when things go so wrong that a message has to be displayed on my screen. Telling me “There’s nothing you can do to fix the problem” would be a big help, for instance. Make sense?

hperrin@lemmy.ca · 10 months ago

I am a developer of software. I can guarantee you that what you’re asking for would make my job harder, because I’ve done it, and it has made my job harder. If an error is transient (like, a caching layer error, a db connection error, an external API error, an endpoint connectivity error, etc), giving the user an error code will make it more likely that they’ll file a useless bug report or support ticket. The errors are all logged internally, and we can see when there is a spike in the error count. There’s no reason to give the user an error code, because there’s nothing helpful that the user can do with it, and there’s a lot of unhelpful things a user can do with it.

There are times where a message to the user is appropriate, like if they made a mistake with their input. But there are so many things that could go wrong that the user can’t do anything about. You’re not going to work around your DB shard going down, and a replica will replace it in a few seconds anyway, so giving you an error code does more harm than good. Telling you to try again later is exactly what I would tell you if you filed a support ticket. I don’t want to deal with useless support tickets, and you don’t want to deal with useless error messages.

Modern software stacks are big, complex systems with lots of failure points. We monitor them, and we can tell when you see these errors. If we chose to not show you a specific error code/message, there’s almost definitely a good reason.

Cryophilia@lemmy.world · 10 months ago

So what you’re saying is that your code is garbage and you’re hiding it from users because it’s too much work to fix it.

hperrin@lemmy.ca · edit-2 10 months ago

What I’m saying is that error messages can be helpful or harmful. Knowing that and how to tell the difference is what makes you an expert. Just firing off any information to the user without thinking about it is what makes you a novice, and will eventually get you fired. We’re talking about systems with millions of daily users. If you cause 2,000 unnecessary support tickets or forum posts every day because you don’t know when to send what information to the user, you won’t get very far in tech.

Cryophilia@lemmy.world · 10 months ago

If you have 2000 daily people getting error messages, your code is garbage rofl

And if your company would rather you avoid those tickets by not giving out error codes, your company is also garbage. Which to be fair, is a lot of tech companies.

hperrin@lemmy.ca · edit-2 10 months ago

I feel like you really don’t understand how big tech works. There’s not some single server running every service perfectly. There are tons of different layers and services running on thousands or hundreds of thousands of hosts.

Let’s say you make a request to something like Facebook. Say you’re liking a post. Here’s what happens:

That request goes in through a PoP (point of presence). These are sometimes called edge servers or edge gateways, but at Facebook we called them PoPs. This is a server that’s physically close to you that’s used to terminate the TLS connection. It doesn’t have any user data. Its job is to take your encrypted request, decrypt it, then pass it on to Facebook’s regional data center on their internal network.

The request enters a webby. These are usually called frontend servers, but again, at Facebook we called them webbies. This is a server that runs the monolithic Facebook web app. Again, it doesn’t have any user data. Its job is to take your request and orchestrate actions on deeper services to fulfill that request.

First it’s going to check a local memory cache server for sitevars. These control system level switches, like AB tests, and whether certain services are brought down. That server returns the sitevars and the webby proceeds, now knowing which logic paths to take.

For a like, which is a write request between your user account and a post, it will create two DB entries (you likes post, post liked by you). It needs to first get the data from the caching layer, so it will make two requests to TOA, one for your account, and one for the post.

TOA runs in the same regional data center, and if it doesn’t have the two data objects cached, it will request them from the regional db shards.

These regional db shards also run in the same data center, and they’ll return the data.

TOA returns the data back to the webby.

The webby (after doing some permission checks, which probably hit TOA again) now creates the two relationships, likes and liked by, referencing the two data objects, you and the post. TOA is a write-through cache, so the webby sends the writes to TOA.

TOA now needs to send the requests to the db primary shards, since they are the only ones that can handle writes. Your primary shard and the post’s primary shard are probably in different data centers, so TOA now passes the writes to the regional data centers for each primary shard.

A host running TOA in each regional data center for each primary shard now passes the write to each shard.

Each primary shard now writes the data to the local disk, and waits for the binary log to be written to the local journal before returning a success message.

The success message is passed from the local TOA host back to the original region’s TOA host.

When that TOA host gets both requests back successfully, it returns a success back to the webby handling your request.

The webby then returns a success to the PoP you’re still connected to.

The PoP then returns a success to the client running on your device.

The client doesn’t notify you of anything, because it already showed you a filled in like button right after you pressed it.

This was how it worked back in 2013 when I worked there. It probably hasn’t changed a whole lot, but this is also an extremely simplified overview. That request will probably hit hundreds of services. Some of them can fail and the request could still succeed. But some are required to succeed for your request to be considered successful, like the db write operations. Something like a hardware failure on your primary db shard’s disk can’t be overcome with better code. Nor can a lightning strike taking out the cable connecting your PoP be overcome with better code.

These systems are absolutely massive, and there are failures you wouldn’t even think of. When I worked at FB, we had an entire data center go down because the humidity got just high enough that the capacitors in each hosts’ power supplies all failed in a matter of a few minutes. Thousands of users probably got error messages that day, but the automatic failover systems moved all the traffic to a new region and promoted new primary db shards within about ten minutes. The fact that losing an entire data center was mitigated in about ten minutes is actually really impressive. You might think it’s still garbage code, since users got error messages, but I know enough about these systems to be very impressed by that.

If you know a better way to make a system like this that works for billions of users across the planet, you should write a paper and submit it to a local conference. If they approve you for a talk, you can present your designs to an audience there. If the audience is really receptive, your designs could make a big impact in the tech sector. That’s basically what the highest level engineers at these big tech companies do when they design these multi-billion user systems, so it’s definitely possible for you to do it too.

unhrpetby@sh.itjust.works · 10 months ago

By nature of software consisting of a client and a server, there are certainly errors that can be bypassed on the client side.

Server side software does not mean “there is literally no errors that are dependent on client input.” That’s ridiculous to think, but pervasive in this comment section it seems.

hperrin@lemmy.ca · 10 months ago

I don’t know why you think what I said means that. These error messages are never used on data validation issues. At least, I’ve never seen a data validation issue return an error like this, and I would never write an error like this for a data validation issue.

These messages come from 500-series errors. Usually caching layer errors, load balancer layer errors, edge termination layer errors, or db layer errors. In other words, there was probably nothing wrong with the request, it just couldn’t be fulfilled successfully, hence the “try again later” part in a lot of these messages.

unhrpetby@sh.itjust.works · edit-2 10 months ago

These error messages are never (sic) used on data validation issues.

You are incorrect. I have had issues that were exactly that. Such as a password that was failing to be accepted and then giving generic error responses, which I then had to trial-and-error brute force to find which part of my password they weren’t allowing on the backend.

You stance might become easier to defend if you avoid absolutes.

hperrin@lemmy.ca · edit-2 10 months ago

Read the next sentence.

It sounds like your problem is not with these errors in general, but with specific software that uses generic messages when not appropriate.

unhrpetby@sh.itjust.works · edit-2 10 months ago

The error is unnecessarily vague.

If the message is supposed to mean “There is an internal error that is of little use to you, so you can only wait while we fix it. Try again in 10 minutes.” Then say that. That tells me a developer made a conscious decision to classify the failure mode as one which I cannot fix. They are explaining to you what type of error they perceive it to be.

Instead we have “Something went wrong. Try again later.” which doesn’t say that directly. This could just be them designing their systems as though every user is incompetent, and denying you the information to fix the issue yourself.

You wouldn’t know, because it doesn’t just tell you directly.

hperrin@lemmy.ca · 10 months ago

It is intentionally and, I would argue, necessarily vague.

First, there is no time frame for these kinds of errors. If it’s just a cache host that’s down, you could retry right now and the load balancer would probably have taken that host out of rotation already. If it’s a primary db that’s down, that may take 5 minutes. If there’s no replica to promote, it might take 30 minutes. If the whole db layer is down, it might take an hour or two. If an entire release needs to be rolled back, it might take a couple hours. There are just too many scenarios and too many variables to give a useful time frame.

Second, you might appreciate an error message like that, but these error messages aren’t written for you and they’re usually not even written by developers. They’re written by designers and translated into many languages. They need to be concise, easily understood, and not easily construed as derogatory or malicious in any language. They are written for the broadest audience. You are not the broadest audience.

Third, we have to design systems as if every user is incompetent and/or malicious, because many of them are. Let me give you an example. I once got an email from another engineer using an internal system my team wrote. He said, “hey I’m getting this error, can you help?” He attached a screenshot showing an error message that read, “Your auth token has expired. Please refresh the page.” He was a senior engineer.

Fourth, and I cannot stress this enough, there is almost always nothing you can do when you hit an error like this. Any information given to you for the vast majority of these kinds of errors would be entirely useless to you. You cannot promote a db shard yourself. You cannot bring up a cache host yourself. You cannot take a host out of load balancer rotation yourself. The only reason this information could possibly benefit you is to satisfy your curiosity.

unhrpetby@sh.itjust.works · edit-2 10 months ago

There is no time frame for these kinds of errors

If I was are able to isolate the issue to, for example, expired certs, I could absolutely give you a ballpark answer on how long it should take/when it might be back up. It doesn’t need to be very precise, but I have accessed websites only to be shown an error with zero idea whether this is a multi-day event or something I can wait five minutes and it be fixed.

…they are written by designers…

Cooperation with a developer would help here.

They are written for the broadest audience

If you write only for a child, your usefulness ceiling is that of what a child could understand. You could have your obvious boilerplate message, and then under that provide more information.

…not easily construed as derogatory or malicious in any language.

I feel as if this is a simple problem to avoid.

We have to design systems as if every user is incompetent…

See the bottom of this post

there is almost nothing you can do when you hit an error like this.

If the company believes so, then write that part in. Otherwise, it isn’t stated that such is the case. It would be one more sentence on the boilerplate section.

Overall this has to do with what you are optimizing for. Its clear to me that many businesses believe useless boilerplate error messages are most cost effective. If you want to be most cost-effective, then cutting corners on the error messages likely saves time with few financial downsides. But It doesn’t have to be this way.

Designing systems for the lowest person on the totem poll isn’t without downsides. I have used Linux systems that made the bootup hide all log messages. This means that people that can actually fix a broken system using the logs, are going to have a harder time, as you just hid away all the moving parts and complexity from the end user. Some machines I wouldn’t have been able to fix were it not for the detailed logs.

Or we could talk about privacy. Nearly everyone can use a computer. Great right!? But how many people actually understand the privacy implications of using a machine that is controlled by a closed source corporation. Of entering load of data into that machine? Very few.

You can design a system for idiots. But you don’t have to. There are things in life that have prerequisites. If someone comes over to my computer and asks “What’s that” on a kernel log output, I’ll ask them, “Do you know what a kernel is”. If they don’t, then I will tell them not to worry about it. My explanations are not for everyone. Neither are my software.

ImplyingImplications@lemmy.ca · 10 months ago

Error messages are a common way for hackers to gain information about a system. Useless error messages are recommended for security.

If you enter your username as Robert’'); DROP TABLE Students;-- giving the error “Oops, something went wrong” is better than “NoSuchTable: ‘Students’ Table doesn’t exist in the database” because now the hacker knows you’re using a database that interprets SQL commands and inputs aren’t being sanitized.

Hacking programs like Burp Suite have functions that spam sites with all kinds of garbage data and uses error messages and delays in response times to highlight potential vulnerabilities.

aesthelete@lemmy.world · edit-2 10 months ago

Yeah but most of these errors don’t even give out a uuid that could be used to relate the error to logs to be resolved by someone.

Not that that someone exists anyway. Let’s face it the entire industry is a massive joke and a pile of shit and with AI coming fast and hard soon you won’t even get the privilege of venting to a call center person about it.

You’ll vent to some made-up chatbot named veeblezorp and he will give you an impromptu therapy session about the state of the world. Your computer/tablet/phone/app still won’t work properly and veeblezorp will try to get you through the stages of grief about that.

Just unplug it and don’t plug it back in again. Go for a walk. Play with the dog. Hug your children. Stop buying crap online that scales up infinitely to take new customers (and their dollars) but is forever stuck at the sub-garage startup level when it comes to support.

Djehngo@lemmy.world · 10 months ago

Even if you don’t buy into this logic, you still have to do it in quite a few places because the security auditors have a line in their checklist about being able to extract any internal information from error pages

jbk@discuss.tchncs.de · 10 months ago

Security through obscurity is still wack tho

MonkderVierte@lemmy.ml · 10 months ago

So in short it’s used to mask the developers errors?

unhrpetby@sh.itjust.works · 10 months ago

Same logic as “keep it closed source for security”.

Possibly linux@lemmy.zip · 10 months ago

This comment belongs on masterhacker

NABDad@lemmy.world · 10 months ago

Users ignore error messages.

I have seen my users request support, proceed to demonstrate the issue they’re having, and click through error messages so fast there isn’t even enough time for me to say “WAIT!” Forget about being able to actually read even one word of the message before it’s dismissed from the screen.

They treat the error messages like they are just an annoying mosquito to be swatted away as quickly as possible. This despite the fact that the whole reason I’m standing behind them is so I can see what it’s going wrong and, you know, read the error messages.

Cryophilia@lemmy.world · 10 months ago

So the solution is to remove the error messages? That makes no sense.

NABDad@lemmy.world · 10 months ago

I wasn’t saying it was a solution, I was just offering a possible explanation. I understand why someone wouldn’t bother telling the user what’s happening. For my code, I log everything that happens, and tell the user to call me. When I get the call, I check the log and use that to figure out what went wrong.

Distressingly typical user communication:

User: There was an error message.

Me: What did the error say?

User: I don’t know. Something about the problem.

Cryophilia@lemmy.world · 10 months ago

If you remove error codes, all you’ve done is guarantee every interaction is like that.

NABDad@lemmy.world · 10 months ago

You’ve got it backwards. Once every interaction is like that, you stop showing error codes.

I have trouble with my colleagues in IT doing the same crap. They come to me to ask me to help them with a problem, and they don’t have the errors!

perishthethought@lemm.ee · edit-2 10 months ago

Them:

“What? Oh that? That always happens”

Me:

Grrrrr

Yeah, you’re right, but I still would prefer to see something telling me whether something I did caused the problem or something went wrong in the software / on the server. From this thread, I’m getting that my wishes will not be heard.

MonkderVierte@lemmy.ml · 10 months ago

That’s Windows’ training bearing fruit.

Possibly linux@lemmy.zip · 10 months ago

What I hate even more are error codes

Executive Chimp@discuss.tchncs.de · 10 months ago

Precise, easily searchable codes that describe the problem? What a ball ache!

MonkderVierte@lemmy.ml · 10 months ago

Not on Windows, no. “This error code could indicate this or that or something”.

perishthethought@lemm.ee · 10 months ago

This person gets it. Thank you.

Moonrise2473@feddit.it · 10 months ago

Exactly, it’s especially infuriating on newpipe. WHAT went wrong? It’s an error 500 from YouTube (rare, unfixable, try again) or Google changed something and need to wait for a client fix? Or simply Google blacklisted the IP address or put some captcha that prevents playing the video??

oni ᓚᘏᗢ@lemmy.world · 10 months ago

If it doesn’t tell you what error it is, isn’t an error message, it’s just a message that says that something went wrong. Even a calculator has proper error messages that show you actual information about what is wrong

cattywampas@lemm.ee · 10 months ago

Because 99% of the time these errors are caused by something on their end that the user is unable to fix, even on the off chance that they understand the problem in the first place. So there isn’t any need to give you more information than “something went wrong, please wait a minute and/or try again”.

bleistift2@sopuli.xyz · 10 months ago

I give my users instructions on how to report an error if they seek assistance. It’s regularly ignored. Instead we get the ubiquitous “Something bad happened … somewhere. HALP!”

Executive Chimp@discuss.tchncs.de · edit-2 10 months ago

“I got an error”

“What did it say?”

“I don’t know, just something went wrong”

“👍”

unhrpetby@sh.itjust.works · 10 months ago

Blanket “99%” statements are unfounded. I have had countless issues I was able to fix through error messages and some without.

Source your claim.

hperrin@lemmy.ca · 10 months ago

I can’t speak for the other user’s claim, but I’ve worked at Facebook, Google, and LinkedIn, and have written plenty of error messages. When I write a message like these, it’s specifically because the user can’t do anything about it. I’ll log the error to our internal error tracking systems with actual information about it, then give the user a generic message.

If it’s something the user did wrong, and they can fix it, I’ll absolutely give them a message saying that. Usually I won’t even let a user submit a bad request, but sometimes users will bypass frontend restrictions to submit it, so the server always needs to validate it again anyway. The fact that plenty of users won’t even read the message I write is kind of annoying, but at least the users who do read it will know how to fix it.

I’ve tried sending detailed error messages before, and that invariably results in users submitting support tickets and forum posts for things that aren’t helpful. You learn pretty quickly what kind of messages are helpful and what kind aren’t.

unhrpetby@sh.itjust.works · edit-2 10 months ago

I would appreciate the detailed error responses even if the developers don’t think it would be of use to them.

When a project has unexpected downtime, and they do a postmortem explaining exactly what part of their infrastructure failed, what steps they took to resolve it, and how they will prevent it in the future, that is great.

I appreciate transparency. Of course, to expect this from a large corporations is expecting a pig to fly, but detailed error messages are one more step away from “We are the cloud” and one step towards “We are real people providing a service which operates on server infrastructure consisting of…” Its transparent, down-to-earth, and respects people who do want to see behind the scenes.

One company I used even had a white paper explaining their infrastructure as a whole.

This all may not make you more money, but I prefer this to instead treating me with the bare minimum insight into the inner-workings.

hperrin@lemmy.ca · 10 months ago

I want you to really pay attention to the last paragraph of my previous comment. It’s the most important part. You might like having more information that doesn’t help you, but that comes at the cost of thousands of useless tasks and posts that all have to be manually closed. It’s not only not helpful to give the user detailed error messages in a lot of cases, it’s actively harmful to a business. It doesn’t make any business sense to tell a user that a cache layer host or a db shard is down. As a developer of these kinds of systems, I’m not going to give extra information that you don’t need just to make a few users happy that they get a peek under the hood if it means hurting our support staff.

unhrpetby@sh.itjust.works · edit-2 10 months ago

How is a user sending a support request containing the information: “Site not working. Error message: A surge in requests is overloading the server. Everyone is being ratelimited.” Any different from them just saying “Site not working.”?

If they were going to submit an issue for a problem that is already known, why would the error message significantly change the difficulty of dealing them?

perishthethought@lemm.ee · 10 months ago

OK but then inherent in what you’re saying is also the message, “… and don’t contact us about this, because we don’t want to deal with it” which is also mildly infuriating to me.

cattywampas@lemm.ee · 10 months ago

You’re assuming they aren’t already aware of the issue.

perishthethought@lemm.ee · 10 months ago

Sorry but how does that help me?

spongebue@lemmy.world · 10 months ago

How does telling someone about a problem they’re already aware of help you?

Forester@pawb.social · edit-2 10 months ago

When you’re in a restaurant and your server accidentally dumps the entire tray of food in your lap, would you prefer them to apologize or just stare at you because you already know what the problem is. Clearly you requested food to be delivered to your table and it didn’t make it there.

spongebue@lemmy.world · 10 months ago

An apology would be nice, but do you really need to explain to the server that you still haven’t received your food? I think they know that

cattywampas@lemm.ee · edit-2 10 months ago

What I’m saying is that when you see one of these messages you should interpret it as “something is wrong on our end, nothing you can or need to do on your end, please hang tight as we’re aware of the issue and working on it”. They don’t give you more info than that because that average person is probably not a dev and doesn’t have any need for more details than that.

DaGeek247@fedia.io · 10 months ago

You’re giving an incredibly large allowance for companies that have continuously calculated exactly how much they can fuck over their customers for more money before we decide to use some other product.

Rhynoplaz@lemmy.world · 10 months ago

But it’s MY Internet, and I want it NOW!!

hperrin@lemmy.ca · 10 months ago

The “we don’t want to deal with it” part is something you’re attributing to them with no evidence. As a former SRE, I can guarantee you they are dealing with it.

🇰 🌀 🇱 🇦 🇳 🇦 🇰 🇮 @pawb.social · 10 months ago

Of course if their servers and whatnot are shit they won’t straight up tell you they are shit.

It’s why modern multiplayer games don’t even show everyone’s latency anymore. It would let players know imperically that their servers are shit.

Thaurin@lemmy.world · edit-2 10 months ago

Iit’s an internal error that is not handled properly. They don’t want to tell you the exact error message and detailed information around that, because it would expose the internal state of the backend and that would be a security issue. There is really nothing more that they can tell you, except that a developer needs to look at this (and possibly thousands to tens or hundreds of thousands of similar logged errors) and they probably already are.

Cryophilia@lemmy.world · 10 months ago

It’s almost never an internal error. The vast majority of the time it’s vpn blocking or some such bullshit.

perishthethought@lemm.ee · 10 months ago

Maybe then, the message could be, “An internal error has occurred and we’re going to work on fixing it but there’s nothing you can do to fix it yourself right now”. It’s the “Oops” that fries my grits.

AwesomeLowlander@sh.itjust.works · 10 months ago

If you’re tech-savvy enough to want detailed error messages, you should also be tech-savvy enough to understand the implied message you just typed out. The ‘Oops’ isn’t for you, it’s for the average user.

cattywampas@lemm.ee · 10 months ago

I do agree, the whole “oops sowwy” with a sad Labrador vibe is a little irritating. But I guess they do it cause it’s a harmless and layman-friendly response.

Forester@pawb.social · edit-2 10 months ago

LMFAO. I probably have to truncate at least five error log files a week on various vps servers at my company because they fill the SSD and crash the OS. We rent servers we don’t dev them for our cx.

Largest error file I’ve seen so far is 32 GB

Site owners are normally clueless. Site developers normally can’t give a single fuck and systems administrators like me. Get to pick up the pieces and tell them to tell their Deb to fix it and then we pick it up again and tell them to tell their Dev to fix it let me know when you sense a pattern

Thaurin@lemmy.world · 10 months ago

Are you not rotating your logs with for example logrotate?

Forester@pawb.social · edit-2 10 months ago

To reiterate, they are not my logs. It’s not my server. It’s a server that the customer is renting and not maintaining and we’re not going to purge their data unless they ask not all logs rotate. Mostly error_log files in garbage wp sites

VeganCheesecake@lemmy.blahaj.zone · 10 months ago

In theory, maybe. In practice, I’ve had a lot of errors in that vein that very much wouldn’t go away, and where made much harder to diagnose by their obtuseness.

Honestly, I even dislike the mindset. Just make a big header with the generic error message and a little one below that gives some details. Having users interested in how your software works is not a bad thing.

scarabic@lemmy.world · edit-2 10 months ago

Look, the majority of the software is on the server end. Even if they gave you a full stack trace, and you understood it, you wouldn’t be able to do anything about it. You want them to give you a way to send in a merge request? I’m sure that would be received here as Google exploiting you for free.

unhrpetby@sh.itjust.works · 10 months ago

“Majority of software” source your claim.

If you use your computer as a bootloader for Google chrome, maybe. Local software and SASS both benefit from error messages because you cannot assume every error you can do nothing about.

An error about parsing my password and a stack trace? I can possibly deduce to limit the length, remove special characters, or add special characters and try again.

“Something went wrong.” is lazy and nontransparent.

Cryophilia@lemmy.world · 10 months ago

Just fucking tell me when my adblock is causing you to deny me access, or my connection is too weak/slow. I don’t need a fucking full stack trace.

scarabic@lemmy.world · 10 months ago

“Hey boss? I got a user here who’s blocking our ads and cussing us out.”

“Oh dear. Better give him whatever he wants.”

Cryophilia@lemmy.world · 10 months ago

Yeah, otherwise you won’t have a user because I’ll find a competitor that is actually workable on my system.

Celestus@lemm.ee · 10 months ago

Never show the customer a big scary error message. In the unlikely event something goes wrong, dispatch an error event so the engineers can track the issue. They’ll pull detailed logs, and know what to do. Meanwhile, guide the customer to return later with the error screen, and collect their userId if possible. Once the issue is fixed, send them an apology email to let them know they should try again

Most companies just follow the first half of that process, unfortunately

perishthethought@lemm.ee · 10 months ago

Yah, I love that idea of follow up but I imagine if the software is used by hundreds of thousands or more people, they can’t follow up on every case.You’d think that could be largely automated but who knows.