Sunday, 18 February 2024

Show HN: I built a vector database API on Cloudflare https://bit.ly/49FBDis

Show HN: I built a vector database API on Cloudflare https://bit.ly/49D7PTE February 19, 2024 at 05:24AM

Show HN: Like HN, but for Science https://bit.ly/3usfLbu

Show HN: Like HN, but for Science https://bit.ly/49kDFVt February 18, 2024 at 10:30PM

Show HN: FakeMyDb – Fake data generator for your RDBMS https://bit.ly/4bJhLMW

Show HN: FakeMyDb – Fake data generator for your RDBMS Introducing fake data generator for your relational database. Are you sick of manually entering data into your database for every darn table that you have before you could test, debug and deploy? With this tool, all you have to do is click a few buttons and then copy the data to your database! easy-peasy-lemon-squeezy. https://bit.ly/4bKOa5R February 18, 2024 at 03:47AM

Show HN: I create a free website for download YouTube transcript, subtitle https://bit.ly/3OOzsAX

Show HN: I create a free website for download YouTube transcript, subtitle https://bit.ly/42FE4iN February 18, 2024 at 10:41AM

Show HN: I Built an Open Source API with Insanely Fast Whisper and Fly GPUs https://bit.ly/3UMJPsS

Show HN: I Built an Open Source API with Insanely Fast Whisper and Fly GPUs Hi HN! Since the launch of JigsawStack.com, we've been trying to dive deeper into fully managed AI APIs built and fine tuned for specific use cases. Audio/video transcription was one of the more basic things and we wanted the best open source model at this point it is OpenAI's whisper large v3 model based on the number of languages it supports and its accuracy. The thing is, the model is huge and requires tons of GPU power for it to run efficiently at scale. Even OpenAI doesn't provide an API for their best transcription model while only providing whisper v2 at a pretty high price. I tried running the whisper large v3 model on multiple cloud providers from Modal.com, Replicate, and Hugging faces dedicated interface and it takes a long time to transcribe any content about ~30mins long for 150mins of audio and this doesn't include the machine startup time for on-demand GPUs. Keeping in mind at JigsawStack we aim to return any heavy computation under 25s or 2mins for async cases and any basic computation under 2s. While exploring Replicate, I came across this project https://bit.ly/3weFJiY by Vaibhav Srivastav which optimises the hell out of this whisper large v3 model with a variety of techniques like batching and using FlashAttention 2. This reduces computation time by almost 30x, check out the amazing repo for more stats! Open source wins again!! First, we tried using Replicate's dedicated on-demand GPU service to run this model but that did not help, the cold startup/booting time alone of a GPU made the benefits of the optimised model pretty useless for our use case. Then we tried Hugging face and modal.com and we got the same results, with an A100 80GB GPU, we saw around an average of ~2mins start up time to load the machine and model image. It didn't make sense for us to have an always on GPU running due to the crazy high cost. At this point, I was inches away from giving up. The next day I got an email from Fly.io: "Congrats, Yoeven D Khemlani has GPU access!". I forgot the Fly started providing GPUs and I'm a big fan of their infra reliability and ease of deployment. We also run some of our GraphQL servers for JigsawStack on Fly's infra! I quickly picked up some Python and Docker by referring to a bunch of other Github repos and Fly's GPU tutorials, then wrote the API layer with the optimised version of Whisper 3 and deployed it on Fly's GPU machines. And wow the results were pretty amazing, the start up time of the machine on average was ~20 seconds compared to the other providers at ~2mins with all the performance benefits from the optimised whisper. I've added some more stats in the Github repo. The more interesting thing to me is the cost↓ Based on 10mins of audio: - OpenAI Whisper v2 API -> $0.06/10mins - Insanely Fast Whisper large v3 API on Fly GPU (Cold startup) -> ~$0.029/10mins - Insanely Fast Whisper large v3 API on Fly GPU (Warm startup) -> ~$0.011/10mins (Note: These are rough estimates I did by taking averages after running 5 rounds each) If you guys want to run this on any other GPU providers you can as long as they support Docker. We'll be optimising this more over the next few days specific to Fly's infrastructure allowing for global distributed instances of whisper and will soon be providing a fully managed API on JigsawStack.com. Stay tuned! https://bit.ly/42GNs5J February 18, 2024 at 09:18AM

Friday, 16 February 2024

Show HN: Chat-focused RAG with automated memory management https://bit.ly/3wjg6xJ

Show HN: Chat-focused RAG with automated memory management https://bit.ly/3witW3g February 17, 2024 at 06:01AM

Show HN: A Flask-Based Internet Radio Player Built in Hylang https://bit.ly/42H6BnY

Show HN: A Flask-Based Internet Radio Player Built in Hylang I ported this small project to Hylang a few weeks ago. It calls out to mpd and mpc, but the most exciting part is that it is written in Hylang, a Lisp that compiles down to Python. It pairs well with [1]. Be sure to use the direct links. [1] https://bit.ly/49hXI6V https://bit.ly/42KmGth February 16, 2024 at 10:30PM

Thursday, 15 February 2024

Show HN: Magika: AI powered fast and efficient file type identification https://bit.ly/42EsjZV

Show HN: Magika: AI powered fast and efficient file type identification We are very excited to announce the release of Magika our AI powered fast and efficient file type identification lib and tool - https://bit.ly/3uAeFu3 Thanks to its optimized Keras model, large scale training dataset, and Onnx Magika massively outperform other file identification tools while be very fast even on CPU. Magika python code and model is open sourced on Github: https://bit.ly/42EskwX and we also provide an experimental TFJS based npm package https://bit.ly/3I1VyfB With the team we hope you will find Magika useful for your own projects. Let us know what you think or if you have any question! February 16, 2024 at 02:08AM

Show HN: Old-school Unix talk for the web https://bit.ly/48jo1YX

Show HN: Old-school Unix talk for the web Posted 12 years ago, but (a) it's been completely rewritten, and (b) it looks awesomely retro now, instead of just some textarea boxes. We would love feedback and PRs. Please note that if you're testing it "solo" - your second browser window should be in an incognito window. https://bit.ly/3wjIXBS February 15, 2024 at 10:40PM

Show HN: Erogen AI – Safe, ethical, and accessible romantic AI companionship https://bit.ly/3OHBxyE

Show HN: Erogen AI – Safe, ethical, and accessible romantic AI companionship Like many others on the recent internet, I've found the idea of AI companionship to be sort of an attractive idea. At the very least, some element of AI companionship seems inevitable in our daily lives after 5-10 years, whether its through video games or on our phones / in our eyes and ears, etc. I had been playing around with locally hosted options for AI chatbot companions but found the interfaces clunky, and also I was really disheartened by the direction of the content and quality I was seeing on existing sites online. I saw some that were completely SFW-handicapped so as to sprout entire communities around filter-breaking, making safeguard nullification the standard mode by your users (dangerous) simply to get some affection (which seems unreasonable given how much violence and sex we're exposed to in TV, movies, and online). On the other end of the spectrum, I saw a few sites that were completely off the rails with its content free-for-all so as to potentially give AI companionship a really bad name. So I challenged myself to create a more polished experience that would be safe, ethical, and accessible to the mainstream. You can find the site at https://bit.ly/3I3gyT8 (nsfw) I shared this site on Reddit a few months ago asking for feedback on the beta version and it's been slowly gaining traction. I just made significant AI speed improvements for site users this month, added the ability for the AIs to change their avatar images dynamically based on your conversation, and implemented customizable AI voices (still adding options), and I'm ready to unveil the new website to the general public in its more complete form. A few upcoming features I'm excited about include long-term memory and speech-to-text for full voice conversations. I'm also exploring avatar generation on the fly as well as moving avatars. The name "Erogen" and its content is meant to be very much oriented towards romance, love, and NSFW - I think that safe romantic and adult roleplay with a premium feel will be essential components towards the success of AI companionship in the long-run, so I'm dedicating my efforts towards building around that type of content in the most transparently direct and reasonable way possible. I'm eager to listen to the community. I'd love for folks to try it out for free and provide feedback! I'm proud of how far its come already (AI has certainly given me coding and content management superpowers), but I'm still improving many things so please let me know if you run into any issues. https://bit.ly/3I3gyT8 February 15, 2024 at 11:13PM

Wednesday, 14 February 2024

Show HN: IaC to Launch Palworld Servers on Fargate Spot https://bit.ly/3wfjyJI

Show HN: IaC to Launch Palworld Servers on Fargate Spot https://bit.ly/3wjXeP7 February 15, 2024 at 05:47AM

Show HN: NeuralFlow – Visualize the intermediate output of Mistral 7B https://bit.ly/3I2sl4m

Show HN: NeuralFlow – Visualize the intermediate output of Mistral 7B https://bit.ly/3UBuawx February 15, 2024 at 04:29AM

Show HN: Gitlab Meeting Simulator 2024 https://bit.ly/3SYH4n1

Show HN: Gitlab Meeting Simulator 2024 Gitlab's meeting recordings on YouTube have tens of thousands of views by people pretending to work. Now you can appear to be in the meeting using your own webcam. https://bit.ly/3SZSfvS February 15, 2024 at 01:22AM

Tuesday, 13 February 2024

Show HN: Instantly play the game I'm developing directly on its website https://bit.ly/42Bhsjj

Show HN: Instantly play the game I'm developing directly on its website I've been working on Athena Crisis for about two years, and full time for the past 9 months. The game is entirely built from scratch using React and CSS without a game engine. It runs anywhere, including the Steam Deck. You can even use a gamepad on the landing page to play! Previously the landing page had a video of the game but my goal was to always just put the actual game on the website. I merged the landing page into the game's monorepo, added the game's React components, and boom – the video was replaced with a playable version of Athena Crisis. Of course, the real game has tons more features, but the landing page now always runs the exact same code as the actual game – including assets, the AI, and the UI/UX – and it is pushed within 5 minutes as the actual game is being updated live. I frequently talk about the tech behind this game (see this React Summit talk about "How Not to Build a Video Game": https://www.youtube.com/watch?v=m8SmXOTM8Ec ) and I'm planning on open sourcing as much as possible in the future. https://bit.ly/42Lug6U February 14, 2024 at 02:13AM

Show HN: Event finding map-based platform https://bit.ly/3SXx65m

Show HN: Event finding map-based platform Hello there, We're trying to develop a map-based event-finding platform for people to explore nearby events. On the following URL u can find our MVP: https://bit.ly/3SXx6SU . The thing is, we are not sure how to find early adopters. We have lots of ideas how to develop the platform, but would like to do it with active feedback from the users. Some of the feature ideas that we have: - Build-in resell marketplace for tickets for people who cannot attend an events - Users' own event facilitation - Events live streaming and PPV - Community building (go together with friends or have no friends to go with and you search people through the platform to go with) - Our own build-in navigation system to the events - Personalized notifications/newsletter for events like the ones you have attended or added in favorites - VR/AR event attending Thanks in advance to those who will check out our platform! :) Best Regards, Kalo https://bit.ly/3SXx7pW February 14, 2024 at 12:48AM

Show HN: Linen.team – A lightweight, thread-first Slack alternative https://bit.ly/49cApLv

Show HN: Linen.team – A lightweight, thread-first Slack alternative Hi HN! I'm Kam, the founder and one of the authors of Linen. Today, we are launching Linen.team ( https://bit.ly/49dZx4M ), a lightweight threaded messaging app for your team. Modern workplace messaging apps (like Slack) are based on IRC, which is great for small groups, but as it scales, breaks down quickly: you either get overwhelmed by notifications or you have to turn them all off. Most chat apps have threads tacked on but aren't built from the ground up with this design in mind. We wanted to create a thread-first experience where you can organize and prioritize conversations so that you are not reliant on notifications to make sure you don’t miss anything. In apps like Slack, you have to check activities, channels, threads, and replies just to make sure you aren't missing anything important. We designed every message in Linen to belong to a thread, so it makes it easy to centralize everything in a single location. We let you select which channels you subscribe to from your inbox. This way, your inbox only has the important channels. This makes it easy to keep track of conversations without having to rely on notifications to make sure you don’t miss anything. We also wanted a better way to separate urgent vs. non-urgent communication. In Linen, we have introduced the concept of a !mention that is designed for urgent/time-sensitive messages. A !mention will send a push notification, whereas an @mention will show up in the person’s inbox. This allows us to encourage more async conversations and reduce the need for the number of push notifications. We also designed the mention system closely with the inbox so that even if you aren’t subscribed to channels, mentions will still appear in your inbox. This is great for joining partner teams where you don’t need to view every conversation but do need to respond when you are mentioned. We believe that most messaging apps are secretly to-do lists in disguise; you have to read, respond, or do some task when you receive a thread. We wanted to give you the ability to manage threads individually. We let you mark each thread as done, which hides them from your inbox and is useful to keep track of tasks. You also can set reminders and mute threads with one click/key. With these features, we make it easy to get to a zero inbox state. This combined with the inbox makes it easy for you to keep track of conversations and make sure you don’t miss anything. Linen is designed for power users. We love keyboard shortcuts and want an experience that is keyboard-first. For many, the messaging app is the app that is used the most. We believe that you should be able to use Linen for an entire day without touching the mouse. We’ve added modern features like CMD+K for navigation. We’ve designed Linen to be fast and lightweight. Our gzipped bundle size is 400KB, so it's fast on first load, and we’ve introduced multiple layers of caching to make sure things are fast on subsequent loads. We’ve been working hard on this app for the past 6 months, so there are still gaps in the platform. But we’re also very excited about the direction we can take. Our focus is on what a modern message platform built in 2024 should look like and what lessons we can take from the previous decades of IRC and messaging apps. If our message resonates with you, we would love for you to give us a try at https://bit.ly/49cAqPz , where you can join our public community and come say hi! February 13, 2024 at 03:31PM

Show HN: Google Sheets as backend for React apps https://bit.ly/49bIHmR

Show HN: Google Sheets as backend for React apps https://bit.ly/42BcpiU February 13, 2024 at 11:18AM

Show HN: Explore a collection of over 5000 Custom GPTs https://bit.ly/3OJjAjb

Show HN: Explore a collection of over 5000 Custom GPTs Hey, HN community. I am Launching EpicGPTStore today. I would like to make it the best gptstore out there. let me know your opinions and feedback on this. ;) https://bit.ly/3UEjQUl February 13, 2024 at 07:57AM

Monday, 12 February 2024

Show HN: Swift Mail, a native macOS app for JMAP mail https://bit.ly/3uyDBSO

Show HN: Swift Mail, a native macOS app for JMAP mail Hello HN! I'm excited to share Swift Mail, a native macOS email client purpose-built for the JMAP mail standard. Primarily constructed with SwiftUI with occasional AppKit elements, Swift Mail combines the speed and efficiency of a modern mail standard with desktop-centric features such as system notifications, keyboard shortcuts, quick look, multiple windows, state restoration, dark mode, and more. Swift Mail distinguishes itself from other email clients with its steadfast commitment to the JMAP standard over the traditional IMAP implementation, facilitating seamless alignment with modern mail features. It supports various innovative Fastmail features, such as multiple sending identities, the ability to send or reply on-the-fly from wildcard (*) aliases, and the ability to swiftly transition between (true) label and folder organization schemes. Swift Mail prioritizes user privacy and does not collect any user data or function through intermediary servers. Instead, it directly connects to the JMAP server with the user's provided account credentials, processing and storing all data locally on the user's device. Currently, Swift Mail is available directly via the Mac App Store with support extending back to Monterey. I’m also running a developer build on visionOS (if you have hardware and are interested in testing a beta release, please reach out to beta at swiftmail dot io). A sincere thank you to everyone who has contributed their valuable insights or participated in beta testing via TestFlight thus far. Looking forward to your feedback! - Karl https://bit.ly/3uAnFzp February 13, 2024 at 03:04AM

Show HN: I built a simple daily budgeting app for me and my wife https://bit.ly/4bDsrwA

Show HN: I built a simple daily budgeting app for me and my wife https://bit.ly/48hjCpB February 12, 2024 at 09:48PM