GitHub account and ik_llama.cpp are down?!
Heya ik, just went to git fetch upstream and noticed something is up??! It seemed a bit unusual given just moments before you were asking about a surge in GH stars... So not sure what is going on...
I'm trying to find your e-mail to reach out, I'm leimgrub - at - gmail - dot - com if you want to connect as I'm happy to help however I can.
If this was on purpose though also zero pressure, everything in life is gravy and this has been a fun ride. However if this is not on purpose then yes I'd like to assist in getting things set back up.
Cheers and much love! I have visited Leipzig myself, it is a lovely city, and my old high school exchange-student friend still lives there. What a curious world in which we live.
-John "ubergarm"
Last screenshot before I refreshed my browser:
People contacted me on my YouTube channel for it. It's so unfortunate!!!! What happened here!!?!
Okay he replied to my email, he seems okay and it was not on purpose. See this reddit thread with screenshot of his email: https://www.reddit.com/r/LocalLLaMA/comments/1m4vw29/ikllamacpp_repository_gone_or_it_is_only_me/
please let folks know on your YT channel
there are at least some forks still available for now e.g. https://github.com/ubergarm/ik_llama.cpp/
Strange, here's the reinstatement form if needed: https://support.github.com/contact/reinstatement
Or if he can't login at all: https://support.github.com/contact/cannot_sign_in
Hey, I scraped all the data from the repo yesterday, all issues, discussions, pr, comments. Asset links are still up. I reached Iwan before sharing just in case. The data is safe!
Sorry to deceive, but upon reviewing the data, the PRs and their bodies are saved, but their comments are missing. The rest seems to be intact. I guess we could start digging through emails to restore those, or scrape indexing engine caches. The GitHub repo will be made open for contributions tomorrow if there are no objections from Iwan.
I guess we could start digging through emails to restore those, or scrape indexing engine caches.
Emails don't contain edits which matter for a lot of the comments. (Emails also have the downside of missing your own content).
The GitHub repo will be made open for contributions tomorrow if there are no objections from Iwan.
I'm a bit confused what you mean by this. Are you planning a fork that will have an archive of old content? I'm not sure what you mean by open to contributions.
Thank you for scrapping the data. I have submitted a support ticket. Let's first see how that goes.
If my account does not get unsuspended, then Github will definitely not be the place where I want to host my open source work.
Ha, it looks like I have been erased from Github altogether. All my contributions to llama.cpp and llamafile are gone.
Ha, it looks like I have been erased from Github altogether. All my contributions to llama.cpp and llamafile are gone.
Yep. It normally replaces things like that with a ghost, if it was a self account deletion.
I also posted on reddit that you replied here.
Yep. It normally replaces things like that with a ghost, if it was a self account deletion.
But I did not self-delete the account. The account was just suddenly suspended.
But I did not self-delete the account. The account was just suddenly suspended.
Yes which is why it removed all references to your account without replacing them with a ghost.
I've submitted a ticket to GitHub, complaining that I can't access the repo - mostly to generate some pressure or at least get an explanation.
You have my full support,
@ikawrakow
. This kind of thing is so frustrating. Just one more reason to take the extra step and run AI and store data locally.
I'm a bit curious, what did the star count surge to?
Also I'm a bit curious, what did the star count surge to?
The repository had 757 stars I last checked. Suddenly it was getting 20-30 stars per day for the last 2-3 days instead of the usual <5. It was my son who alerted me to this by sending me one of those star history graphs where the star curve has started to look almost vertical. I normally don't pay too much attention to this, but I was checking yesterday after the alert and at the time of the suspension ik_llama.cpp
had received more stars for the day (36) than llama.cpp
(28).
Emails don't contain edits which matter for a lot of the comments. (Emails also have the downside of missing your own content).
But others might have your content. You are right about the edits, though.
I'm a bit confused what you mean by this. Are you planning a fork that will have an archive of old content? I'm not sure what you mean by open to contributions.
Yes, I put together a repo with all the scraped data organized by files, and made it human readable with markdown. The missing PRs comments could be contributed by the community for those who still have the relevant emails, but as you pointed out, it's not ideal.
Thank you for scrapping the data. I have submitted a support ticket. Let's first see how that goes.
No problem, I'll wait until then. I sent you the data by email.
So, I got this response from GitHub:
Hi,
Thank you for contacting GitHub Support.
For security reasons, we’ll need you to reach out to us from the email address associated with the account.
Please log out of your current GitHub session and open a new request using the form here (you won't be required to sign in):
https://support.github.com/contact/cannot_sign_in
Once we have a request from the verified email address, we'll certainly do our best to help!
Warm regards,
Did that, but that just created a new ticket.
They take a long time to respond to tickets if you weren't a paying member of github. A month or more. It might be sensible to move to a different git service in the short term. Their suspensions are usually automated and their field of fvcks is barren, even when you're relatively famous.
What is currently considered to be the best alternative to Github?
What is currently considered to be the best alternative to Github?
I am currently scraping GH archive to assemble the missing pieces. If you self-host a GitLab instance, we could reconstruct the old repo there. But it may be preferable to use the main instance or other alternatives that are managed by a real team, which may be simpler for you.
I don't want the hassle of self-hosting. This is something I may have done 20 years ago. Also, the only time I ever got hacked in my quite long experience with computers was by self-hosting a GitLab instance at a $former_job.
Just emailed [email protected] and legal‑[email protected] from my work address, asking for an urgent review and stressing that ik_llama.cpp is "a critical dependency" for some government projects I'm on - not entirely untrue. Based on past dealings with Microsoft, I'm not holding my breath, but it's one more nudge...
Echoing @mcm07 's suggestion, Codeberg seems like a perfect fit, it's a community-driven, fully hosted service, but run as a non-profit dedicated exclusively to FOSS. It's built on the Gitea software. You get the stability and core features (pull requests, issues, releases) without the corporate overhead or the hassle of managing your own instance.
Self hosting may be a PIA!
Would you also consider starting a discord channel or a reddit community or something where we can have a communication? It appears that two disconnected/independent services are needed going forward. I'm glad we can chat about it over here at hugging face tho.
I created a support ticket on my side. My ticket(s) in the past was generally resolved in 1-2 days. Let's hope that they will restore the access very soon.
Pain in the Ass. Discord asks for phone # on signup. Its a bit of a walled garden. To me it's like making a facebook group but for gen z.
A discord channel would be nice! I think members from the community can help you build one.
Discord also provides webhooks and easy-to-use SDK / API which make it to be very extensible.
Yeah agree with @mcm07 codeberg is a nice alternative to GitHub.
Completely disagree with discord though, like @Lockout said it's basically a walled garden. I'm guessing that a lot of people that are into local AI models care at least somewhat about privacy. Last time I checked some users on discord were being asked to verify with a picture of themselves. And that is in addition to the phone number already required upon signup.
Edit: oh it gets worse, it's actually a face scan of yourself or government ID
Well, if the issue is not resolved by tomorrow morning, I will create a repository elsewhere and put the code there.
@ThomasBaruzier was proposing GitLab, which may allow to recreate the PRs and discussions.
This will not be possible on Codeberg?
@ThomasBaruzier was proposing GitLab, which may allow to recreate the PRs and discussions.
You can only recreate all data if you have direct access to the database, which is a feature exclusive to a self-hosted instance.
For non-self-hosted platforms, a middle-ground solution would be to restore historical data using the official APIs provided by the hosting service under a single "dummy" account. Each reconstructed message would provide a small header containing metadata such as the author and timestamp.
While both GitLab and Codeberg support issues and pull requests, I’m unsure about discussions.
What is currently considered to be the best alternative to Github?
How about huggingface :thinking:
While migrating the repository—including issues and pull requests—to GitLab is a straightforward process; the user adoption and community engagement on a different platform might not be the same. While not ideal, one approach might be to continue hosting the project on GitHub to preserve community interaction, while maintaining a mirrored copy on a secondary platform for redundancy.
Some more folks trying their best to show support, thanks Wendell! https://x.com/tekwendell/status/1947409487603896483
Quick update for GH archive scraping. Since the service doesn't offer filtering, you have to download every event on the GitHub API, per hour, and work it from there, so it took a bit of time. The results are promising. I'm seeing messages sent a few hours before the takedown.
The output is a 43MB JSON file, and 4858 events have been collected, including:
CommitCommentEvent
CreateEvent
DeleteEvent
ForkEvent
GollumEvent
IssueCommentEvent
IssuesEvent
MemberEvent
PullRequestEvent
PullRequestReviewCommentEvent
PullRequestReviewEvent
PushEvent
WatchEvent
Let's see what platform Iwan decides on tomorrow. But basically, Codeberg is FOSS and community-driven, while GitLab offers a larger feature set / is more popular. Both are good options.
Hey
@ikawrakow
, hey folks!
I just saw what happened. No comment. O_o
Hope it will be resolved soon with GitHub, because the conversations were precious, beyond the code itself.
Crossed fingers, and if to no avail, let's move elsewhere otherwise.
By the way for people not aware. I am saood06 from the github.
The output is a 43MB JSON file, and 4858 events have been collected, including:
I'm not sure how much that is considering there ~3800 commits on main before it was removed (and ~500 issues/PR).
I also have never done Discord. Is this something people would appreciate?
I also very much disagree with the idea of a discord. Besides the account/privacy aspect it is not a good place to hold information.
While both GitLab and Codeberg support issues and pull requests, I’m unsure about discussions.
I don't think either of them do.
I can't believe they just killed your account like that! Not even locked or private-only where you can retrieve your data, just gone?!
A few thoughts here:
I wouldn't self-host src. The chance of this happening again is quite low IMO (github probably screwed up), and managing the infrastructure / patches, etc is tedious and error prone.
Personally I'd go with gitlab or bitbucket, etc.Regardless, it's also worth having a mirror with another provider (this was an eye opener for me!) I don't know if there are any services/tools to duplicate discussions across, but at the very least, a git-level mirror of the code here on huggingface.
I haven't run a project like this before, but I see HF has discussions and pull requests + all the standard git features. And everyone using this project would have a HF account.
And given it's an ML platform, the staff participate / probably love projects like this, could it work? Or does it lack features you'd need?
One issue with this idea is uptime though. At least a few times per month I encounter issues pulling/pushing datasets/models.
https://status.huggingface.co/ doesn't actually capture all of these.
Anyway, I hope it gets reinstated as I was (slowly) learning a lot reading the discussions there in my spare time.
I'm not sure how much that is considering there ~3800 commits on main before it was removed (and ~500 issues/PR).
This is more about GitHub than git. The scraping captured all text sent to GitHub that wasn't in the .git folder of the repository. Besides, I scraped the entire repo before the takedown directly from the official GitHub API, but it failed to capture PR comments, which seem available in GH Archive.
All that to say that I'm pretty confident we have everything.
So, I have added ik_llama.cpp
to Codeberg: https://codeberg.org/ikawrakow/ik_llama.cpp
It does not need to be its final home, but I wanted to test what happens after git push --all codeberg
. All commits and branches are there but PRs, discussions, Wiki, issues are of course missing.
HF was proposed as a possible option to host ik_llama.cpp
. I think it is best to ask
@ngxson
if this is a viable option.
@ubergarm 's friend Wendell offered to provide a computer for a self-hosted Gitlab instance. This has the advantage of being able to import a lot of the Github add ons, but for me it is only viable if there is a committed volunteer to take care of administration and backups.
Github replied my this morning, but unfortunately they cannot help much:
It appears there may be an issue with your friend's account. However, for privacy and security reasons, I'm only able to discuss account-specific details with the account holder directly.
If they reach out to us through our Account support form, we'll be happy to assist further.
Re. HF as a hosting option, I think it's possible but may not be an optimal option. The experience for reviewing big code files is still not very good.
I think for now, we can temporary go with either codeberg or gitlab. The main problem will be difficulty to setup and sync CI workflows from upstream llama.cpp
Thank you for trying.
I have obviously reached out to them via their support form. "It appears there may be an issue" and no notification about the issue or the suspension? This is starting to look really weird.
The ticket ID is 3586176 in case that helps.
They don't email. They often shadowban. I found out from a popup on one of the pages. In my case it looked like everything was fine when I was logged in. To everyone else it was 404.
Wow, and still no notification! You know, sending emails is really expensive these days with all the inflation going on.
So, what should I learn from this and how should I proceed?
So, what should I learn from this and how should I proceed?
That’s a good question. I think it was shut down by some automated process by mistake - possibly due to an unusual spike in stars, scraping activity, or a complaint from a malicious actor. Then it was reviewed and the action was reversed. What I find horrifying is that they didn’t even bother to consult you or at least inform you of the reason for the takedown.
Personally, the lesson I’ve learned is that you just can’t trust the cloud. It’s fine to use for convenience, but not for critical infrastructure or data.
Nothing to learn, other than GH is unreliable. You just have to wait until they reply. My first GH account died a little bit after microsoft bought them out. I had signed up with a temporary email and used it for a couple years. Suddenly they enabled 2FA to make me more "secure" and secured me straight out of my account.
They almost kicked me off permanently more recently when they started requiring non-email 2fa some time back. Luckily you could use an authenticator vs a phone number. I'm sure many people just did the text thing. Microsoft gonna microsoft.
How do I download all content, including discussions, PRs, Wiki, issues, etc.?
How do I download all content, including discussions, PRs, Wiki, issues, etc.?
Additionally, migrate to Codeberg can do it for Issues, PRs, Wiki but Discussions seems almost locked on github.
How do I download all content, including discussions, PRs, Wiki, issues, etc.?
PR incoming
It may be better to move to https://github.com/ikawrakow/ik_llama.cpp/discussions/636 for additional comments.
Looks like it's back
Yup, back online again!
@ikawrakow
, glad your account was restored. Hopefully your access was too!
If you ever find out, or feel like sharing, curious about what happened
@ikawrakow , glad your account was restored. Hopefully your access was too!
If you ever find out, or feel like sharing, curious about what happened
Thank you. I did not receive an explanation. Not even a notification that the account has been reenabled (someone else noticed). I guess we will never know.