@MagicShel

MagicShel@programming.dev · 1 month ago

I think job postings are better in indeed, but tbh >75% I’ve gotten in pretty much my whole 25+ year career has been through a recruiter. Dice.com used to be big for tech jobs back in the day but I’m not sure any more.

As a SSE, mostly I have recruiters hitting me up through linked in. This is also a really bad time. I’ve been back to work for about a month after 5 months of not finding anything. That’s the worst drought I’ve had in almost 15 years. Usually it’s < 1 month.

Be seriously prepared about cloud. It’s so anyone fucking wants right now. I’m a damn good Java/js developer, but I’m still learning the tech stack and I haven’t touched a line of code yet in this job. Everything has been configuration and pipelines. I feel more like devops than developer.

MagicShel@programming.dev · 1 month ago

I never buy gas at BP or Exxon. Ever. Smart phone though? That’s required for my job.

MagicShel@programming.dev · 1 month ago

Who would do such a thing??

MagicShel@programming.dev · edit-2 1 month ago

I don’t think I would agree that just because something is public that it’s a public forum. I feel like the public has to own it as well. I looked it up and maybe it’s because I predate social media by rather a lot, but I think of it in the classical sense:

Public forums are typically categorized into three types:

Traditional Public Forums: Long-established spaces like parks or sidewalks, where people have historically exercised their rights to free speech and assembly.

Designated Public Forums: Areas that the government intentionally opens up for public expression, such as town halls or school meeting rooms.

Limited Public Forums: Spaces opened for specific types of discussions or activities but with certain restrictions on the subject matter or participants.

The important factor being public ownership of the forum. I will concede that it has colloquially come to include public social media, but I think it’s important to distinguish that it’s not really the same thing at all as has been discussed through most of our history.

Food for thought. I just think calling them public forums attaches too much importance to a profit seeking endeavor.

MagicShel@programming.dev · 1 month ago

I would only note that for the vast majority of my experience these streams can only return up to a single match. Determinism isn’t really preserved by findFirst, either, unless the sort order is set up that way.

Finding the first Jim Jones in a table is no more reliable that finding any Jim Jones. But finding PersonId 13579 is deterministic whether you findFirst or findAny.

Perhaps you work in a different domain where your experience is different.

MagicShel@programming.dev · edit-2 1 month ago

I try to prefer .findAny() over .findFirst() because it will perform better in some cases (it will have to resolve whether there are other matches and which one is actually first before it can terminate - more relevant for parallel streams I think. findAny short circuits that) but otherwise I like the first. I’d probably go with some sort of composed predicate for the second, to be able to easily add new criteria. But I could be over engineering.

I mostly just posted because I think not enough people are aware of the reasons to use findAny as a default unless findFirst is needed.

MagicShel@programming.dev · edit-2 2 months ago

I think the process of explaining what you want to an AI can often be helpful. Especially given the number of times I’ve explained things to junior developers and they’ve said they understood completely, but then when I see what they wrote they clearly didn’t.

Explaining to an AI is a pretty good test of how well the stories and comments are written.

MagicShel@programming.dev · 2 months ago

Something something sharks?

MagicShel@programming.dev · 2 months ago

I appreciate you pointing this out. Today is literally my first day on the job after 5 months, but I’ll throw some cash their way after a check or two.

MagicShel@programming.dev · 2 months ago

Idk. Been doing it for nearly 20 years and before that I was doing IBM’s take on VBScript for another 10. So I have my own perspective there. I’ve only ever had to parse massive xmls when doing web apps, and for web backends I really only like Java and NodeJS.

But everyone is entitled to their own take. I would imagine there is a streaming parser in other languages as well.

MagicShel@programming.dev · 2 months ago

Maybe look into StAX?

MagicShel@programming.dev · edit-2 2 months ago

You made a lot of points here. Many I agree with, some I don’t, but I specifically want to address this because it seems to be such a common misconception.

It does and it doesn’t discard the original. It isn’t impossible to recreate the original (since all the data it gobbled up gets stored somewhere in some shape or form and can be truthfully recreated, at least judging by a few comments bellow and news reports). So AI can and does recreate (duplicate or distribute, perhaps) copyrighted works.

AI stores original works like a dictionary does. All the words are there, but the order and meaning is completely gone. An original work is possible to recreate by randomly selecting words from the dictionary, but it’s unlikely.

The thing that makes AI useful is that it understands the patterns words are typically used in. It orders words in the right way far more often than random chance. It knows “It was the best of” has a lot of likely options for the next word, but if it selects “times” as the next word, it’s far more likely to continue with, “it was the worst of times.” Because that sequence of words is so ubiquitous due to references to the classic story. But over the course of following these word patterns, it will quickly glom onto a different pattern and create a wholly new work from the original “prompt.”

There are only two cases in which an original work should be duplicated: either the training data is far too small and the model is overtrained on that particular work, or the work is the most derivative text imaginable lacking any flair or originality.

Adding more training data makes it less likely to recreate any original works.

I am aware of examples where it was claimed an LLM reproduced entirely code functions including original comments. That is either a case of overtraining, or far too many people were already copying that code verbatim into their own, thus making that work very over represented in the training data (same thing, but it was infringing developers who poisoned the data, not researchers using bad training data).

Bottom line: when created with enough data, no original works are stored in any way that allows faithful reproduction other than by chance so random that it’s similar to rolling dice over a dictionary.

None of this means AI can do no wrong, I just don’t find the copyright claim compelling.

MagicShel@programming.dev · 2 months ago

I had an A500 and the 40MB drive was as expensive as the computer.

MagicShel@programming.dev · 2 months ago

I can definitely account for 1.

MagicShel@programming.dev · 2 months ago

Sometime between 2013 and 2018. Had to answer it myself. It got at least a couple dozen upvotes and a lot of people finding it useful and asking follow up questions.

It’s deleted now. To be fair it was probably really outdated. But my account seems to be completely gone now. Maybe it got hacked. I haven’t been there in a long time.

MagicShel@programming.dev · edit-2 3 months ago

Also, this copy reads like it was written by AI. If this is indicative of the stuff on the website, I very much would not like to read more. If it was written by a human, they should definitely lay off the LSD before writing.

it is obvious that this powerful synergy will change our societal norms and potentialities.

Fuck right off.

MagicShel@programming.dev · 3 months ago

I mostly get what you’re saying, though I don’t have the requisite understanding to follow formal proofs, but if there is one thing I do know for certain, it’s that “understanding” is anthropomorphizing and shorthand for something that is very much not understanding in a human context at all.

I get that it can be hard to find the right words to explain a some of these emergent phenomena, but I think it’s misleading to use words that make AI appear to have a thought process akin to anything we could understand as such—at least in settings where folks might not understand the shorthand as such.

And maybe everyone here is aware of that, but it makes me uneasy, hence this comment to hopefully make that point.

MagicShel@programming.dev · 3 months ago

Okay. Well I’m not that worried until I see where things are headed. I can see a lot of ways for things to go badly, but no point in borrowing trouble over it.

MagicShel@programming.dev · edit-2 3 months ago

Yes and this was my reasoning for saying it would be fine to federate. But I’ll point out that federating ads would mean using my server’s infrastructure to serve ads on behalf of someone else. That would cost the admin more money and would require more user donations to keep it going. So just being able to block isn’t the necessary solution. Not sure that was even your point but I wanted to bring it up.

MagicShel@programming.dev · 3 months ago

My instance is defederated from threads. At the time I mildly disagreed with that decision. Federated ads would vindicate that decision. I don’t need threads content that badly.