Posts

Reflections on Reaching 1 Million People on Stackoverflow

This week, I have officially reached more than 1 million people on the website stackoverflow.com! I wanted to take a moment to reflect on this “achievement”, what it means for my professional career, and why I simultaneously believe that it is sheer luck (and a LOT of procrastination) that got me here.
As always, if you have any questions or comments, feel free to message me at dennis.aumiller@gmail.com or reach out on Twitter.

Monday, November 6, 2023 Read

Filing for a Spousal Green Card from Abroad, An Experience Report

TL;DR: Immigrating into the US is a long and arduous process, even on “easy mode” when obtaining a permanent residence through a spouse with a strong passport. Below, I detail the primary steps of the immigration process for a foreign-based consular filing process, which differs somewhat from the “standard” immigration process through US Citizenship and Immigration Services (USCIS). It involves 1. the pre-conditional filing for an expedited process through a consulate/embassy (form I-130), 2. similar-ish filing of the “green card application form” (form DS-260), 3. a visa interview at the consulate, for which you 4. need a whole lot of additional forms, documents and the like. A full timeline can be found at the end of this blog post!

Thursday, August 24, 2023 Read

Discovery of the New Cohere Summarization Endpoint

Two weeks ago, Cohere.ai announced their new dedicated summarization endpoint! For someone currently doing their PhD on text summarization, this is both worrying, but obviously also a rather intriguing development: while recent advancements have been focusing on rather broadly applicable models (think, chatGPT), providing more task-specific alternatives seems to be the niche that Cohere is carving out for themselves.
Adding to the surprise of seeing a dedicated summarization endpoint is the fact that text summarization is really hard; in the last 50 years, a lot of progress has been made, but our current state-of-the-art models still suffer from annoying problems such as correctly retaining factuality of the input text. Another problem is the actual definition of “summaries” in different domains. Methods for generating a “good” summary of a news article are generally useless when it comes to generating the summary of a court ruling, or generating radiology reports from doctor notes. Due to the combination of these (and other) factors, there are comparatively few productive settings in which summarization is actively used. To my knowledge, the two main applications using some form of summarization right now are some news aggregators, summarizing information from multiple news articles (which primarily uses extractive methods, meaning directly copying existing sentences from the input documents), as well as the recently introduced “Document TL;DR” generator in Google Docs (the latter using a variant of their own PEGASUS neural model).

Sunday, March 5, 2023 Read

Lunch Notes: 'The Internet Never Forgets', or does it?

I recently started clearing out some of my old browser bookmarks, something I highly encourage you to do, too! Given that I have been in the academic world for close to ten years now, I still mix a lot of my personal browsing with the professionally required one, which created more and more bloat over time in my browser.
Aside from the Mari Kondo-esque liberation it gives, I initially went through my bookmarks to finally start separating them into various categories, putting a more distinct divide between my “personal” and “professional” online footprint. In some sense, it also felt like clearing out a childhood bedroom, given that I have consistently used the same browser (Firefox) for close to fifteen years now, and certainly not all bookmarks hold the same relevance today as back then.

Friday, January 20, 2023 Read

New Personal Website

I have recently migrated my personal website, which turned out to be hugely outdated, since I originally planned to write a platform entirely from scratch (spoiler: not a great idea with time constraints). Especially since I am not a trained web developer, I quickly realized that there are quite a few significant hurdles to overcome before I could have a somewhat decent system that relies on minimal outside dependencies, and even then, I already had acquired quite a bit of external “npm bloat”. This doesn’t even include the time I had to spend figuring out how to efficiently update my website in case of changes without requiring logins to my server every couple of days…

Tuesday, August 9, 2022 Read