Im a grown up, I have self preservation instincts. Only indexing opt in accounts and only for limited time so the angry mob won’t burn me.
Im a grown up, I have self preservation instincts. Only indexing opt in accounts and only for limited time so the angry mob won’t burn me.
I make one for web dev and mastodon.
Yeah. For major things. For trivial stuff like choosing lunch place people usually do public voting.
Here’s my way of doing it. TLDR: LUKS with a encryption key hosted in my router
https://nowicki.io/self-hosting-lvm-raid1-with-key-over-ftp/
Holy f… I thought you’re joking but yes tar is indeed a tape archiver
Shameless self promo: I was upset by this as well so I’m working now on a curated search engine just for anything related to webdev. It focuses on blogs and docs. No BS, just high quality sources.
Also it’s hosted on a PC in my living room ;)
I keep my drives encrypted with a key currently hosted in my router hoping they wouldn’t steal that. I’m thinking of actually putting it to cloud so I can disable it remotely.
It was quite a ride to make everything work and I made a blog post explaining it so I remember what I did.
https://nowicki.io/self-hosting-lvm-raid1-with-key-over-ftp/
So it was DNS?
Thanks for the kind words!
Thanks but don’t expect too much yet. Many sources are still missing. If you notice something should be there but it’s not even being crawled feel free to reach me one Mastodon or add it directly via PR here: https://github.com/Kukei-eu/spider/blob/main/index-sources.js
Another person in real life told the same. Adding to the backlog!
I’m on iPhone 12 mini. I love that small design and I strongly believe phones should be small.
Thanks for the good words! Highly appreciate it!
Same for Reddit but here I have mixed feelings about it in general and hope it’s going to die soon being replaced by amazing Lemmy communities.
I also used to type some question and end with “reddit” in Google to get good quality content, but here with kukei the experiment is whether blogosphere can replace it properly when index is promoting it.
This is my main thing. To promote good quality blogs that I tried to follow via RSS but somehow never did. Having them all indexed (and more, some Mastodon community gave me amazing links to index) makes me actually visit them often.
For the “SEO cancer” that where curation comes into play. Before crawling I check unknown blogs to me and decide whether something goes in or not.
Great ideas. For the source code I’m not sure but I’ll put it to the backlog of cool things I get from Lemmy and work on them one by one. Thanks!
The crawler takes only the sources that are defined in the crawler repo (it’s open source, check the github org or kukei-spider).
So in this way it’s “curated” in a sense that it would not add anything else to the index.
Thx for the comments. I’ll fix the mobile view and will definitely redesign it all a bit over weekend. I see a lot of room for improvements.
Also will check how to submit it to Lenses. Highly appreciate it!
EDIT: mobile view is fixed, also did some small adjustments in the whitespaces between result items.
Good idea. I had this thought once to do some narrow indexing of websites, e.g. stack overflow is a big issue, indexing all of this is crazy, picking up some specific tags on the other hand feels like tons of work. In the end I adjust the whole project as it grows with hope that after every tuning it gets better.
As long as I have fun with it I’ll continue :D
For ?? I guess it already has a decent results. I’ll periodically check those kind of cases once the index gets more languages.
Thanks! If you have some suggestions in the future I’m always open to hear
Punycode would work here better I think as it’s plain ASCI with no special characters except a dash if I recall correctly.