wip clipper post

2023-07-20 00:47:18 -07:00 · 2023-07-20 00:47:18 -07:00 · bea2d15b6b
commit bea2d15b6b
parent 3a1d2710d6
1 changed files with 152 additions and 0 deletions
--- a/content/posts/announcing-clipper.md
+++ b/content/posts/announcing-clipper.md
@ -0,0 +1,152 @@
+++
+date = "2023-06-24"
+draft = true
+path = "/blog/announcing-clipper"
+tags = ["clipper", "rust"]
+title = "Announcing Clipper: TLS-transparent HTTP debugging for native apps"
+++
+
+<!-- FIXME: wire_blahaj.jpg -->
+
+Sometimes, I get jealous of web developers for having real network debugging
+tools for watching HTTP traffic. Other times, I decide I can have nice things
+too and spend a few weeks writing a tool.
+
+A while ago I was debugging an issue with the Rust OpenTelemetry library and it
+was immensely frustrated by not being able to get the actual requests out of
+it. Eventually it yielded when I forked `h2` to add some logging that would be
+exceptionally inadmissible in production. This is (probably) not the fault of
+either rust-opentelemetry or the HTTP library that the observability system was
+not observable, but it sure was frustrating.
+
+Perhaps HTTP libraries should have a standard interface for dumping the raw
+requests they perform, but this is going to be filtered through what the
+library *thought* it did, rather than necessarily what it actually did. Plus,
+there's hundreds of HTTP libraries, so the experience would be undoubtedly
+quite variable.
+
+What if there *was* one tool that was universal and could read any HTTP traffic
+with no configuration, regardless of the app? What if debugging a modern native
+app's HTTP traffic (HTTP3 coming soon) could be (almost) as easy as opening dev
+tools in a browser?
+
+If only modern computers had a [Clipper chip] and you happened to be the NSA so
+you could just decrypt everything. Thankfully that is not the case. I suppose
+the next best thing is to have code execution, a binary patching library, and
+insufficient fear of `LD_PRELOAD`.
+
+[Clipper chip]: https://en.wikipedia.org/wiki/Clipper_chip
+
+# What is this thing?
+
+Clipper is a suite of tools for doing TLS-transparent HTTP interception. It
+supports:
+* Unprivileged packet capture that also catches keys, storing to PCAPNG files.
+* Attaching Chrome DevTools to unmodified native processes and viewing HTTP
+  activity.
+* Viewing PCAPNG files in Chrome DevTools.
+* Extracting keys from unmodified applications.
+
+## Wire shark doo doo doo doo doo doo
+
+<!-- FIXME: tls added and removed here slide -->
+
+But it's encrypted! Sadly, "TLS added and removed here" is an architecture of
+the past, and for good reason. However, what if we had the session keys and
+decrypted the TLS so we could read it?
+
+<!-- FIXME: link -->
+It turns out that Wireshark can actually do this, if you have the keys already.
+However, getting the keys is the hard part.
+
+<!-- FIXME: link to SSLKEYLOGFILE spec -->
+Also, it turns out that we *did* collectively standardize getting the keys out
+of TLS, and practically every TLS implementation implements it. Specifically,
+there is a standard format called `SSLKEYLOGFILE`, originally implemented in
+Mozilla NSS, for logging the decryption keys from TLS libraries.
+
+Cool, so we're done? Well, as much as it is implemented in the TLS libraries,
+code changes to client code are required to actually use it. This is probably
+for good reason, since it does break TLS:
+
+- `curl` gates it behind a compile flag that is off by default. Not sure about
+  libcurl.
+- NSS gates it behind a compile flag which is off by default.
+- Firefox only enables the NSS flag on nightly or other dev builds.
+- rustls requires you set a special field on the ClientSettings and
+  ServerSettings structures, which downstream users do not do by default.
+- Go `crypto/tls` requires a similar method to rustls.
+- OpenSSL requires you call `SSL_CTX_set_keylog_callback` when initializing it.
+- Chromium implements the `SSLKEYLOGFILE` environment variable by default, yay.
+
+  <!-- FIXME: link -->
+
+  Alarmingly, there is a bug report of this feature being abused by antivirus
+  vendors to do TLS decryption, leading the Chromium developers to remove the
+  "your traffic is being intercepted" banner. So perhaps everyone else removing
+  it was a good idea.
+
+## Fiddler2, OWASP ZAP, mitmproxy
+
+These are all fine tools and perhaps better suited to this use case. However,
+in order to use any of these proxies, one needs to execute an actual
+woman-in-the-middle attack, which has Consequences:
+
+- Proxy support is required
+- Support for adding trusted CAs is required
+- Key pinning must be disabled
+- It's *possible* that certain HTTP library compatibility bugs may be concealed
+  by virtue of the proxy decoding and reencoding the data.
+
+I would like to look at the *actual* traffic and not put anything relevant in
+the data plane to the greatest extent possible.
+
+# What the hell crimes did you do to achieve *that*, jade
+
+Glad you asked. Numerous!
+
+Let's start at capturing packets. Normally this requires having high privilege
+on Linux in order to be able to bind a `AF_PACKET` socket to capture packets.
+However, `CAP_NET_RAW` privilege is in the eye of the beholder. Thanks to the
+technologies used by unprivileged containers, this is possible and fairly easy.
+
+Specifically, by entering a new user namespace and network namespace, Clipper
+jails processes away from the host network onto a virtual network over which it
+has root due to the user namespace. Then it can open a `AF_PACKET` socket and
+send a handle out of the namespace to the host process over a `unix(7)` socket
+and capture everything going on in the namespace.
+
+However, that's not sufficient, since the inner process cannot actually connect
+to anything from a new network namespace and *host* root would be required to
+attach virtual interfaces to the host network stack. Fortunately, again, the
+missing pieces were built for rootless containers: `slirp4netns` is a userspace
+NAT daemon that NATs a network namespace onto the host network, creating a
+`tap` interface with internet access inside the namespace.
+
+As for getting the keys, either you can patch the binary at compile time
+(boring, may require patching dependencies), or you can patch the binary at
+runtime (fun, indistinguishable from malware behaviour). Since I want a magical
+user experience, runtime patching it is.
+
+<!-- FIXME: link -->
+Some absolute sickos at [mirrord] made a system which allows a local process on
+a developer computer to interact with the network as if it is on another
+machine in a staging environment. This was achieved by using the excellent
+[Frida] GUM Rust bindings, which allow hooking arbitrary code in executables,
+along with `LD_PRELOAD` to get their code running inside processes.
+
+[Frida]: https://frida.re
+
+Clipper also uses Frida GUM. The actual patches to extract keys aren't much:
+- In OpenSSL, hook `SSL_new` to first call `SSL_CTX_set_keylog_callback` on the
+  passed in context.
+- In rustls, they use vtable dispatch over a field set in the client/server
+  settings structure. This seems like a pain in the ass, so we instead hook the
+  no-op key log functions to not be no-ops.
+- In Go `crypto/tls` they use a similar mechanism to rustls and we do the same
+  thing.
+
+Once we have the keys, we can add universal `SSLKEYLOGFILE` support, or send
+the keys over a socket to the capturing Clipper instance. Both of these are
+implemented.
+