Outline of a trainable, streaming tokenizer for NLP with Elixir

Patrick Tschorn wrote “Virtually all NLP tasks require some form of tokenization, and in many cases the tokenizers provided by popular NLP libraries are adequate. If, however, the input material strays sufficiently from the norm, the available tokenizers may not be satisfactory and it may turn out that it is nearly impossible or far too costly to adapt…”

Strife: Using OAuth to make a Discord profile page

Tom Parker-Shemilt wrote “I’ve written an app (Strife) to provide Discord profile pages, but I’m surprised no-one had already done so, so I’m going to show you how I did it. I’ve started using Discord recently, and I was surprised that there was no way to link to a users profile. You can search for profiles inside the…”

Docker without Dockerfiles

Dickon Reed wrote “Why we use Docker Sometimes an innovation has such an impact that, when talking to developers, it seems everybody has decided to learn the same thing at about the same time. Maybe that’s Rust, or Haskell, or Ruby, or Python, or Perl, or C++, or C, or Turbo Pascal or BASIC. Maybe it’s the novelty…”

Reading ARFF files with Elixir

Patrick Tschorn wrote “If you are implementing a machine learning approach, you are likely to want to test it on publicly available datasets. A large number of these datasets use the ARFF file format established by Weka. I am not aware of any Elixir ARFF readers, so I am going to explore writing one (‘Arfficionado‘) in this blog.…”

The Terminology of Knowledge Graphs

Alexandra Orth wrote “Terminology is constantly evolving and often confusing. This is especially true in tech with the constant re-defining/re-naming/re-branding/re-implement churn that companies undertake to make their product stand out. A few months ago some questions appeared in a thread regarding the use cases for knowledge graphs. Knowledge graphs and graph databases had a resurgence in popularity the…”

Fixing networking after a Tunnelblick crash

Dickon Reed wrote “If you are running tunnelblick to access a VPN using OpenVPN on a High Sierra Mac, and for instance your whole system crashes, once you’ve logged back in you may lose all network access, even when you have the VPN down. In my case the fix was to open Terminal and run: networksetup -setdnsservers "iPhone…”

Building a GuixSD Vagrant box

Tom Parker-Shemilt wrote “I’ve been curious about the use of declarative mechanisms for creating operating systems for some time. In contrast to most configuration management tools which say certain things that will be true and let everything else do what it likes (particular packages will be installed, particular services in a named state, etc), declarative mechanisms declare the…”

Building rule-based machine learning systems from scratch

Patrick Tschorn wrote “Sometimes, it is obvious that a project needs machine learning, but you can tell that simply pumping the data through all the algorithms in a popular library (and picking the one algorithm that performs least badly) is not the answer. Machine learning libraries cannot cover all algorithms, trade-offs and heuristics specific to arbitrary problem domains.…”

https://www.flickr.com/photos/64318484@N04/6844840040/sizes/o/

Can Great Britain run fully on renewable energy?

Krista Hyer wrote “In Great Britain, electrical power is supplied via the National Grid, which constantly and carefully balances supply against demand. This is a vital and high-stakes process—deviations of more than 1% in the Grid’s conditions can lead to infrastructure damage and power outages due to blown fuses. (Even deviations of less than 1% will have noticeable…”

Lego Telepresence bot: how not to try and build one

Tom Parker-Shemilt wrote “Four years ago, I built a Dalek-based telepresence bot (part 1, part 2), and I’d been idly thinking for some time that what I really needed to do was make a better follow-up, as a much better one could probably be done with Lego Mindstorms especially given the existence of the BrickPi board for interfacing…”

Three books every software developer should read in 2019

Patrick Tschorn wrote “Ok, that title is tongue-in-cheek clickbait, but here are three books which in my opinion are likely to broaden your horizon by offering surprising perspectives on the most powerful tool that you have at your disposal: your mind (and your body). I have found these books to be filled with interesting observations and pieces of…”

Dockerizing Sybase and connecting to it from Elixir

Patrick Tschorn wrote “On a recent project, we were required to connect our software to a Sybase DB on Linux, which we managed through FreeTDS and ODBC. I will summarize the relevant details in this post. Please be aware that a number of alternative solutions are conceivable. I built Docker images for our software, so that I would…”