Friday, 12 February 2021

Show HN: Clubhouse Room Search Engine https://bit.ly/3adKN9O

Show HN: Clubhouse Room Search Engine https://bit.ly/2NnIbx4 February 12, 2021 at 11:06PM

Show HN: Search inside YouTube videos using natural language queries https://bit.ly/3d5QGYv

Show HN: Search inside YouTube videos using natural language queries https://bit.ly/3qiEes6 February 12, 2021 at 10:29PM

Show HN: Check if my startup ideas have legs https://bit.ly/374tPZJ

Show HN: Check if my startup ideas have legs https://bit.ly/3jJyuFu February 12, 2021 at 09:37PM

Show HN: Build a dashboard to monitor all your statuspages in one place https://bit.ly/3agJZkC

Show HN: Build a dashboard to monitor all your statuspages in one place https://bit.ly/3aTkGEc February 12, 2021 at 07:05PM

Show HN: Augmenting tabular data with SDV to improve ML results https://bit.ly/2Zdbp4w

Show HN: Augmenting tabular data with SDV to improve ML results https://bit.ly/3rU8GcN February 12, 2021 at 06:51PM

Show HN: A Python Libary That Simplifies Data Validation https://bit.ly/2NoTl4F

Show HN: A Python Libary That Simplifies Data Validation https://bit.ly/3tT3TKk February 12, 2021 at 05:21PM

Show HN: Marketing Patterns – DIY Template for Growth https://bit.ly/3aYQGqF

Show HN: Marketing Patterns – DIY Template for Growth https://bit.ly/3qi1QNx February 12, 2021 at 06:42PM

Show HN: Minimalist Tor-to-Web Proxy https://bit.ly/3af71bJ

Show HN: Minimalist Tor-to-Web Proxy https://bit.ly/3acjahx February 12, 2021 at 05:52PM

Show HN: DeFiOptions – ERC20, collateralized, cash settable options on ethereum https://bit.ly/2MYn98B

Show HN: DeFiOptions – ERC20, collateralized, cash settable options on ethereum https://bit.ly/2ZbNJO4 February 12, 2021 at 12:50PM

Show HN: Digital Pottery in the Browser https://bit.ly/3rNErny

Show HN: Digital Pottery in the Browser https://bit.ly/3rKspLM February 12, 2021 at 01:05PM

Show HN: Create animated and interactive drawings in the browser https://bit.ly/3pepkSz

Show HN: Create animated and interactive drawings in the browser https://bit.ly/3tRHcWV February 12, 2021 at 01:04PM

Show HN: Zig Xor Filters (“Faster and Smaller Than Bloom Filters”) https://bit.ly/3b33Kv6

Show HN: Zig Xor Filters (“Faster and Smaller Than Bloom Filters”) https://bit.ly/2NnOuRf February 12, 2021 at 09:55AM

Thursday, 11 February 2021

Show HN: I built an URL shortener to share Clubhouse events https://bit.ly/3pdrAcS

Show HN: I built an URL shortener to share Clubhouse events https://bit.ly/3qe8Xa3 February 10, 2021 at 05:34PM

Show HN: Easy Dependency Injection for Golang https://bit.ly/3tQz5tO

Show HN: Easy Dependency Injection for Golang https://bit.ly/3q8xf57 February 12, 2021 at 01:57AM

Show HN: Real-time multiplayer games with cubes. Early feedback on dev docs? https://bit.ly/3rP8Wd1

Show HN: Real-time multiplayer games with cubes. Early feedback on dev docs? https://bit.ly/3tOghLF February 12, 2021 at 12:51AM

Show HN: EffectNode Studio, trying to make complex graphics code more manageable https://bit.ly/375nEES

Show HN: EffectNode Studio, trying to make complex graphics code more manageable https://bit.ly/3addjZu February 11, 2021 at 11:57PM

Show HN: git-peek – git repo to local editor instantly https://bit.ly/3phpbgZ

Show HN: git-peek – git repo to local editor instantly https://bit.ly/2OuR4pn February 11, 2021 at 11:07PM

Show HN: Sleepy Time Conference—conferences that comes together while you sleep https://bit.ly/373ByHe

Show HN: Sleepy Time Conference—conferences that comes together while you sleep https://bit.ly/3pauYVB February 11, 2021 at 10:50PM

Show HN: Nirvana.Work – Automated Task Scheduling – Put Sprints on Autopilot https://bit.ly/3qbUawx

Show HN: Nirvana.Work – Automated Task Scheduling – Put Sprints on Autopilot https://bit.ly/3qdalJU February 11, 2021 at 06:08PM

Launch HN: Cord (YC W21) – training data toolbox for computer vision https://bit.ly/2Z7vuJB

Launch HN: Cord (YC W21) – training data toolbox for computer vision Hey HN community - I’m Ulrik from Cord ( https://bit.ly/3qdaflw ) in the current YC W21 batch [1] - we are building software that allows people to label their data intelligently using a toolbox of various ‘labeling algorithms’. Labeling algorithms are any units of intelligence (e.g. a pre-trained model, or an interpolation algorithm) that help automate the annotation process. This enables data science and machine learning teams to rapidly iterate on their ML models without having to farm out labeling tasks to an external workforce. Today we’re launching the first part of our product, our Web App, which serves our initial set of automation features through a GUI. It also allows you to classify images and draw vector labels, visualize data, and perform collaborative QA. Computer vision ML algorithms are widely used for tasks like detecting everyday objects such as cars and pedestrians. However, they are yet to see widespread adoption for things like detecting cancerous polyps during an endoscopic procedure or blood clots in MRI scans. The lack of massive-scale labeled training datasets that fuel contemporary approaches is often the blocking element in building ML applications that solve these more specialised tasks. We also believe that the core part of the IP of an ML application stems from the labeled data used to train it. Creating these datasets is challenging for several reasons. Labeling the data requires expensive domain-expert annotators, and privacy might prevent the data from being sent to an external workforce. Ultimately most labeling work tends to be done using open-source tools that were not created for speed and purpose-built to handle massive-scale datasets[2]. These tools also tend to provide a poor experience for the end consumer of the training data (e.g., data scientists, ML engineers) because they lack intelligence and require high manual input. The initial seed of the idea came while I was working on a CS master’s project of visualizing massive-scale medical image datasets. I saw saw how much time and effort was being spent by doctors on labeling data. I met my co-founder Eric, who had worked as a quant researcher in finance, and after meeting him we realized we could take an algorithmic approach to tackling the labeling problem. Instead of writing trading algorithms, we turned our focus to writing labeling algorithms. For example, for a food calorie estimation project we translated image level classifications of food items to individualized bounding box labels using a labeling algorithm we wrote with our SDK, requiring only one manual label per food item. Although it was an image dataset, our algorithm approximated noisy bounding box labels by using a CSRT object tracker across images. It then trained a shallow Faster RCNN ‘micro-model’ on the noisy labels, ran inference on the data, and suppressed earlier noisy labels. We then quickly visually reviewed and adjusted the results on our Web App[3]. We have applied a similar approach in areas such as gastroenterology[4] and pathology. The days of relying on an army of human annotators and waiting to start the model building process are hopefully (soon) over. We are incredibly excited to be driving for that change - and are delighted to be sharing Cord with the HN community! We would love to hear your feedback. How are you going about creating and managing training data today? What are your key constraints? If you have used a creative method to label your data before, please share. Thank you so much in advance! [1] What I Learned From My First Month at Y Combinator - https://bit.ly/3a9Osp3... [2] Why You Should Ditch Your In-House Training Data Tools (And Avoid Building Your Own) - https://bit.ly/3d71DZU [3] Label a Dataset with a Few Lines of Code - https://bit.ly/3aV3w9h... [4] Pain Relief for Doctors Labelling Data - https://bit.ly/3b0cbHP... February 11, 2021 at 06:06PM