Max-P

Max-P@lemmy.max-p.me · 3 days ago

This. They even provide the cover image to use. If they don’t want embedding they could just block the request.

But they don’t want to. They want to sell the cake and eat it too.

Max-P@lemmy.max-p.me · 3 days ago

BlueSky is its own thing with its own federated protocol called ATproto. They have an explanation in their docs on how it works, different features. There’s a bridge between the two as well, a bit janky but effective.

Max-P@lemmy.max-p.me · 4 days ago

You just put both in the server_name line and you’re good to go.

Max-P@lemmy.max-p.me · 8 days ago

I think a part of it is that english is just the default language and strongly leans american already, so there’s just no demand for a USA instance and people just use the popular or thematic ones for that content. There’s no advantage in laws to prefer US hosting.

The country ones make sense because they’re also a different language, like jlai.lu in french, and the feddits for European languages.

Max-P@lemmy.max-p.me · 24 days ago

The problem with a different spoof for each domain is that this behavior on its own can be used as a fingerprint based on timestamp and IP in access logs.

Hiding among the crowd is probably better, especially since newer versions of Chrome all report the same UA you blend in even more.

Max-P@lemmy.max-p.me · 25 days ago

You can block them and over time it should get better, or you can write a script that does some checks and blocks them for you.

Max-P@lemmy.max-p.me · 26 days ago

Also, series F but they’re only deploying on one server? Try scaling that to a real deployment (200+ servers) with millions of requests going through and see how well that goes.

And also no way their process passes ISO/SOC 2/PCI certifications. CI/CD isn’t just “make do things”, it’s also the process, the logs, all the checks done, mandatory peer reviews. You can’t just deploy without the audit logs of who pushed what when and who approved it.

Max-P@lemmy.max-p.me · 30 days ago

My point was really that data can’t be that exensive even with including transit fees like Cogent and Level3, because I can use TBs of bandwidth every month and OVH doesn’t even bother measuring it.

If my home ISP gives me a gigabit link, yes I pay for all the cabling and equipment to carry that traffic. But that’s it, I already pay for infrastructure capable of providing me with gigabit connectivity. So why is it that they also want me to pay per the GB?

In Europe they can provide gigabit connectivity for dirt cheap with no caps, they don’t even bother with tiered speed plans there, how come my $120+/mo Internet in the US isn’t sufficient to cover the bandwidth costs? It’s ridiculous, even StarLink doesn’t have data caps.

But somehow communities with crappy DSL that can barely do 10 Mbps still have ridiculously low data caps. It’s somehow not a problem for most ISPs in the world, except US ISPs, the supposedly richest and most advanced country in the world.

Max-P@lemmy.max-p.me · 1 month ago

Yeah sure, then why is it that my entire bare metal server leased from OVH costs less than my Internet connection, and is fully unmetered access too.

I pay for a data rate and I should be able to use the full amount as I please. If we paid for the amount of data then why are we advertising speeds and paying for speeds?

Max-P@lemmy.max-p.me · 1 month ago

Why does the government keep trying to regular fake Internet money? The whole point of it was that it was a free for all. Who the fuck cares if crypto bros get fucked, if you want real securities you go to a real bank and open a real investment account.

Max-P@lemmy.max-p.me · 1 month ago

The data set is paywalled so it’s hard to know. If they picked shovelware most people would rather pirate then yeah, they could reach that conclusion easily.

Denuvo could also be just making people forget about the game once the hype dies down so they never end up trying it which ends up never buying it.

Some people also end up buying the game in sale later, or well after they played it. I personally ended up buying a lot of the games I pirating a while back, well after their release.

Max-P@lemmy.max-p.me · 1 month ago

You have to keep in mind, when you write JavaScript, there’s an entire runtime written in C++ to run it under the hood, with some crazy optimizations to make it reasonably performant. What type of languages do you use to write that runtime? A systems programming language like Rust and C++.

You don’t have to use Rust if you don’t like it. Not everything must be written in Rust. The whole pick a language also involves a lot of picking your tradeoffs. Picking a interpreted/JIT language for speed of development is a perfectly valid tradeoff, but not one you can universally make. Sometimes the performance cost becomes really expensive currency-wise, where you can save thousands of dollars on server costs by simply having a more efficient application that only needs a fraction of the hardware to run it. Even in JavaScript, a fair chunk of libraries you use end up calling to C++ native code because it would be too slow in pure JavaScript. Sometimes the tradeoff is pick the popular language so it’s easier to hire for cheaper.

Even at the dawn of time, most computers shipped with a variant of BASIC so people could write simple applications easily. But if you wanted to squeeze out every bit of power in your Apple II or C64, you sure did reach for assembly. Assembly sucks so we made C, then C++. Rust is still a language that’s made to eventually compile to assembly/binary and have the same performance as if you wrote it in assembly.

And low spec hardware still exists: the regular Pis have gotten pretty fast but if you run on an RP2040 then suddenly, you’re back in like 300MHz dual core land with pitiful amounts of memory, so you do need to write optimized and fast code for those.

Rust’s type system is actually really, really good. Most of the time, if it compiles it runs. It eliminates a ton of errors other than memory safety: the system is so powerful you can straight up make invalid state unrepresentable. You can’t forget to close a connection, you can’t pass the wrong data, you can’t forget to unlock a lock. It does a lot more to enforce correctness of a program well beyond memory safety.

Max-P@lemmy.max-p.me · 1 month ago

I subscribe to a few more communities and my DB dump is about 3GB plain text, but same story, box sits at 5-15% most of the time.

Max-P@lemmy.max-p.me · 1 month ago

A few woes at the beginning but it’s been running smoothly since. If you have experince setting up stuff in Docker and exposing them to the Internet over HTTPS, it pretty much mostly just works.

Max-P@lemmy.max-p.me · 1 month ago

I had to block ByteSpider at work because it can’t even parse HTML correctly and just hammers the same page and accounts to sometimes 80% of the traffic hitting a customer’s site and taking it down.

The big problem with AI scrapers is unlike Google and traditional search engines, they just scrape so aggressively. Even if it’s all GETs, they hit years old content that’s not cached and use up the majority of the CPU time on the web servers.

Scraping is okay, using up a whole 8 vCPU instance for days to feed AI models is not. They even actively use dozens of IPs to bypass the rate limits too, so theyre basically DDoS’ing whoever they scrape with no fucks given. I’ve been woken up by the pager way too often due to ByteSpider.

My next step is rewriting all the content with GPT-2 and serving it to bots so their models collapse.

Max-P@lemmy.max-p.me · 1 month ago

That’s pretty much why I made my own instance: nobody can take it away from me. I can ban whichever instance I deem hostile or don’t want content from. Nobody’s taking away my API anymore or shoving ads in my face.

Nobody can pull a Reddit or Twitter on the fediverse, there will always be alternative instances to use putting pressure on the big ones to not drive away people.

Max-P@lemmy.max-p.me · 1 month ago

Log seems to indicate issues with scanning, which could be maybe too many APs around. I believe I may have experienced something similar in a mall briefly.

Does turning off WiFi help? Like full on airplane mode, and make sure to disable WiFi scanning when WiFi is off as that remains on by default for location services, you want to kill WiFi scanning completely.

Max-P@lemmy.max-p.me · 1 month ago

Telegram was built to protect activists and ordinary people from corrupt governments and corporations – we do not allow criminals to abuse our platform to evade justice.

So who gets to pick what’s a lawful request and criminal activity? It’s criminal in some states to seek an abortion or help with an abortion, so would they hand out the IPs of those “criminals”? Because depending on who you ask some will tell you they’re basically murderers. And that’s just one example.

Good privacy apps have nothing to hand out to any government, like Signal.

Max-P@lemmy.max-p.me · 1 month ago

Because AT&T doesn’t have confusing branding such as the whole 5Ge which is really just them catching up with 4G+ that everyone else already had but totally not to trick users into thinking they’re getting 5G

Max-P@lemmy.max-p.me · 2 months ago

I’ll take the autotools over Gradle, that’s how much it sucks.