Admin of the Bestiverse
Japan already passed a law that explicitly allows training on copyrighted material. And many other countries just wouldn’t care. So if it becomes a real problem the companies will just move.
I think they need to figure out a middle ground where we can extract value from the for profit AI companies but not actually restrict the competition.
I don’t think they’re wrong in saying that if they aren’t allowed to train on copyrighted works then they will fall behind. Maybe I missed it in the article, but Japan for example has that exact law (use of copyright to train generative AI is allowed).
Personally I think we need to give them somewhat of an out by letting them do it but then taxing the fuck out of the resulting product. “You can use copyrighted works for training but then 50% of your profits are taxed”. Basically a recognition that the sum of all copyrighted works is a societal good and not just an individual copyright holders.
I can’t help, just chiming in to say that I’ve also had that experience with Immich. It’s the one service I’ve used that has somehow managed to break itself multiple times like this.
No idea how it happens, I don’t do anything weird with the setup and it just breaks. I’d heard that feedback from other people too but didn’t believe it until it happened to me. It’s been a few months so maybe I’ll try again, I’m just not too happy importing hundreds of gigs of photos multiple times.
So yea just… you’re not alone, good luck.
~15k lines of actual Rust code.
undefined
@ ❯ git clone https://github.com/torvalds/linux && cd linux && tokei Cloning into 'linux'... remote: Enumerating objects: 10655741, done. remote: Counting objects: 100% (1067/1067), done. remote: Compressing objects: 100% (208/208), done. remote: Total 10655741 (delta 961), reused 859 (delta 859), pack-reused 10654674 (from 3) Receiving objects: 100% (10655741/10655741), 5.13 GiB | 13.37 MiB/s, done. Resolving deltas: 100% (8681589/8681589), done. Updating files: 100% (87840/87840), done. =============================================================================== Language Files Lines Code Comments Blanks =============================================================================== Alex 2 222 180 0 42 ASN.1 15 656 441 87 128 Assembly 10 5226 4764 0 462 GNU Style Assembly 1336 372898 271937 56600 44361 Autoconf 5 433 377 26 30 Automake 3 31 23 3 5 BASH 59 2029 1368 352 309 C 34961 24854959 18510957 2766479 3577523 C Header 25450 10090846 7834037 1503620 753189 C++ 7 2267 1946 81 240 C++ Header 2 125 59 55 11 CSS 3 295 172 69 54 Device Tree 5582 1744314 1430810 83215 230289 Gherkin (Cucumber) 1 333 199 97 37 Happy 10 6049 5332 0 717 HEX 2 173 173 0 0 INI 2 13 6 5 2 JSON 894 542554 542552 0 2 LD Script 8 377 289 29 59 Makefile 3062 81226 55970 12993 12263 Module-Definition 2 128 113 0 15 Objective-C 1 89 72 0 17 Perl 61 43843 34461 3909 5473 Python 280 84204 66996 5198 12010 RPM Specfile 1 131 111 2 18 ReStructuredText 3672 761388 577410 0 183978 Ruby 1 29 25 0 4 Shell 957 187353 130476 23721 33156 SVG 79 52122 50727 1303 92 SWIG 1 252 154 27 71 TeX 1 234 155 73 6 Plain Text 1455 134747 0 110453 24294 TOML 3 47 28 12 7 Unreal Script 5 671 415 158 98 Apache Velocity 1 15 15 0 0 Vim script 1 42 33 6 3 XSL 10 200 122 52 26 XML 24 22177 19862 1349 966 YAML 4545 512759 417504 19285 75970 ------------------------------------------------------------------------------- HTML 2 28 22 3 3 |- JavaScript 1 7 7 0 0 (Total) 35 29 3 3 ------------------------------------------------------------------------------- Markdown 1 248 0 177 71 |- BASH 1 2 2 0 0 |- C 1 20 12 6 2 (Total) 270 14 183 73 ------------------------------------------------------------------------------- Rust 91 15207 11065 2248 1894 |- Markdown 85 7773 747 5253 1773 (Total) 22980 11812 7501 3667 =============================================================================== Total 82608 39520940 29971358 4591687 4957895 ===============================================================================
Because most people do not understand what this technology is, and attribute far too much control over the generated text to the creators. If Copilot generates the text “Trans people don’t exist”, and Microsoft doesn’t immediately address it, a huge portion of people will understand that to mean “Microsoft doesn’t think trans people exist”.
Insert whatever other politically incorrect or harmful statement you prefer.
Those sorts of problems aren’t easily fixable without manual blocks. You can train the models with a “value” system where they censor themselves but that still will be imperfect and they can still generate politically incorrect text.
IIRC some providers support 2 separate endpoints where one is raw access to the model without filtering and one is with filtering and censoring. Copilot, as a heavily branded end user product, obviously needs to be filtered.
I understand why they need to implement these blocks, but they seem to always be implemented without any way to workaround them. I hit a similar breakage using Cody (another AI assistant) which made a couple of my repositories unusable with it. https://jackson.dev/post/cody-hates-reset/
This showed up on HN recently. Several people who wrote web crawlers pointed out that this won’t even come close to working except on terribly written crawlers. Most just limit the number of pages crawled per domain based on popularity of the domain. So they’ll index all of Wikipedia but they definitely won’t crawl all 1 million pages of your unranked website expecting to find quality content.
I chose FSL-MIT for my latest project that I plan to run as a service: https://fsl.software/
It's not technically OSS, but it is exactly what I want from a license. Users can do anything they want except make money off it themselves, but 2 years after release the software converts to MIT so you can make money off an old version of the software if you wanted. Basically I as the dev/maintainer get a 2 year lead on selling it as SaaS, and if you want to make money off of the latest versions we need to negotiate a different license agreement.
I think it's a good balance between being open source but also ensuring that development actually has a viable funding route.
I am up to speed on this little drama, but it’s still unclear to me what they’re suing over.
Yea, Honey effectively took over affiliate links. And yes, they were obviously shady (I never used it, because I did not know how they made money). But I don’t quite understand how other people trying to make money from affiliate links have a real claim against them.
Or is this just a case of the influencers realizing they have the moral high ground and the public’s ear, and wanting a pay out?
This was a potential explanation as to why Bezos did that https://lemmy.haley.io/post/1058450
This article is so weird. “We outperformed our competitors because Intel improved the performance of the AWS owned cryptography library we use.”
So like…what did you guys do? Don’t get me wrong, I’m not saying that you did nothing, but your career would go better if you put it in the article that is specifically for bragging about your accomplishment.
How much money do you donate to your ad-free lemmy instance? Or the rest of the free services you’re using?
For the vast majority of people, that number is $0.
Are you willing to bet the stability of an entire language's dependency ecosystem on that? Just so that we can write "crates.io" instead of "crates.rust-lang.org"?
That's really the question. I do agree that there's almost no chance it goes away as too many places and too much money depends on it.
I doubt they will too, but it's still dumb that an entire package ecosystem now has to hope that ICANN will make another exception and special case .io
ICANN tried to phase out .su, the only reason they didn't was because Russia was big enough to tell them no.
Forgot to mention .sh, which is also a ccTLD for a tiny island nation, and also shouldn't be used for hosting anything that is difficult to move.

Does crates.io have a backup plan?
It's possible that the .io cctld is going to go away [0]. Does crates.io have a backup plan at all? Does anyone know what problems it would end up causing?
I imagine the package registry having to move domains is going to cause a ton of problems.
Frankly, it's concerning to me that so much of the Rust ecosystem has chosen to standardize on shaky ccTLDs. The Indian Ocean Territory (.io) is a small island territory whose only inhabitants are a single military base, it is crazy to use that domain for something important. Serbia (.rs) is more stable, but they could still cut off access for non-Serbians if they wanted to.
The 3DS has a screen that size?
Edit: I’m dumb, I made this comment thinking this alarm clock was a watch…somehow. It definitely looks like the sameish size screen
On the feature side, according to Mastodons recent 4.3 release post development is only 4 full time employees and a budget of under $500k annually. That is basically nothing in the realm of social media companies.
Improving Mastodons features requires money and resources, but Mastodons users are unwilling to pay for instances and unwillingly to fund development. Hell, the .world folks host a bunch of instances for collectively hundreds of thousands of users and they take in about $1k a month in donations. I’m surprised that even covers hosting costs.
So…it’s no wonder that it isn’t going to be as polished as other social media in ways that would reduce the attrition.
That’s cool. Personally I just integrated it into my normal chat client by connecting Aichat, which supports a ton of backends including Ollama and hosted options, with Matrix.
Blog post with more info https://jackson.dev/post/chaz/
PieFed has implemented Topics, which are groups of communities maintained by the instance admin. I think they plan to make topics per user at some point.
Meh, just run several associated services and keep the same username on all of them. Nothing is interoperable, stop trying to force it. And a rogue app with bad user data handling practices is still going to leak your data, even if you store your copy of the data securely.
My fediverse accounts are always "patrick@
<service>
.bestiver.se". I currently am only running Mastodon/Lemmy and a few supporting services (e.g. a link manager - https://bestiver.se/@patrick), but I'm adding more as I get to them. Pixelfed, Peertube, Loops(?), Piefed...Adopting this ActivityPods thing looks like it will require each Fediverse project to make what I'd guess are fairly significant changes to their user data handling, and none of those projects are properly funded for this. In fact what this actually seems to be doing is asking every other Fedi app to build on top of their user data API.
I applaud the attempt at building a new standard in the Fediverse, but I doubt it's going to happen.