Assorted links

archiwindow

Archiset - a wonderful set of architectural and movie illustrations. Actually look at nearly everything by Federico Babina.

Bayesianism and Causality, or Why I am only a Half-Bayesian. Great article introducing issues of causality in scientific inference. Most important point - "Causal assumptions, in contrast [to statistical assumptions], cannot be verified even in principle, unless one resorts to experimental control." Another great quote: "The third resistance to causal (vis-a-vis statistical) assumptions stems from their intimidating clarity...assumptions about how variables cause one another are shockingly transparent, and tend therefore to invite counter-arguments and counter-hypotheses." After reading this, I'm inclined to learn more about various attempts at a causal calculus, particularly the Do Calculus.

Ultrametric semantics of reactive programs. This paper describes how to encode causality (the property that a stream function depends only on past values) in the type system. The specific mechanism is an ultra-metric space - specifically, a distance function defined roughly as d(stream1, stream2) = pow(2,-n) where n is the first time at which the streams differ.

Machine Learning: The High-Interest Credit Card of Technical Debt. This article gives a broad overview of how many ML systems are (unavoidably) poorly engineered and suffer many of the problems of bad code. E.g., unused or weakly used dependencies, unexpected feedback loops. Very important read for anyone building such systems.

Yet again I found myself re-reading Universal Portfolios by Thomas Cover. This describes a portfolio allocation strategy roughly analogous to adversarial bandit algorithms - a prediction-free strategy which should (in the long run) achieve better returns than any constant portfolio. The proof is interesting. The strategy is first shown to have returns equal to the integral over all possible constant portfolios. Then the Laplace method of integration guarantees that the average converges to the best.

How to understand the drawbacks of K-means

The Big Problem is Medium Data

A key part of statistical thinking is to use additive rather than Boolean models

What effect size would you expect. Discusses what specific measurement should count as a "replication".

Ten lessons learned from building real ML systems.

Spoofers Tricked High-Speed Traders by Hitting Keys Fast

Poor in the US = Rich. A short article with many links, and given that this is from givewell.org, it suggests why resources should be diverted from helping poor/rich Americans to helping actual poor people elsewhere. Also notable: Hunger here vs hunger there.

Apparently Shanley Kane was a great big racist. The horrible weev-kind. Her twitter feed pretty much admits it's true.

How regulators (ab)use bank regulation for non-democratic purposes.

If I ever find myself in a situation where I have a gas oven, I'll be sure to bake a french toast rosat.

french toast roast

Gawker publishes an article semi-apologizing for spawning a lynch mob. Well, also defending themselves, now that they are the victim of one.

Subscribe to the mailing list