Friday, July 5, 2024

Introducing "Quanta" to create a more balanced and valuable media ecosystem

The underlying economics for much of the web is driven by advertising. People generate data through interactions, publishers chase engagement for revenue, advertising platforms generate 100’s of billions of dollars in profit each year. I have been collaborating with Loren Kohnfelder on a number of projects, and in this area we believe that the consequences of this model have profoundly transformed web content and the way it is consumed, sacrificing civility, quality and independence for outrage, quantity and centralization. Let's consider a few different perspectives :

  1. For people, engagement is mostly centered around social networking platforms (Facebook, Instagram, Tiktok, Twitter (or now "X"), Reddit etc) with a typical pattern being to locate content off-site (often from news or entertainment sites), reference and then ‘discuss’ onsite. The push to video (through Tiktok, Instagram Reels, Google showing YouTube video results with high priority) are all a reflection of a consumption culture, with video not only showing a longer time of engagement, but also enabling valuable video ads. All the social networking platforms mentioned above are all ad-based, and so are motivated to generate engagement, measured by key metrics such as on-site time and content interaction (likes, shares, retweets, subscribes etc).
  2. In order for publishers to generate revenue (some subscriptions, but mostly ad based), they are driven to controversy (which drives engagement) and SEO optimization (so that they can appear above other similar content in searches and get picked by the social Algorithm). This even devolves into content copying - a publisher with better SEO optimization can steal content from elsewhere and pass it off as its own, generating more ad revenue. This is particularly rife on Youtube, where the perpetrators will even file DMCA take-downs against the original content producer.
  3. To not risk losing eyeballs, social media platforms generally have a ‘feed’ model of some sort - a news feed or other infinite scroll list of unrelated content. This context-switch heavy model allows for new, small bursts of dopamine to be released by people as they ‘consume’ the content, the net effect being to have a thoughtless consumption of time, and discouragement of deep thinking., but it's also tiring and discourages deep thinking. Studies show that such doomscrolling is detrimental to mental health and focus, though having no content consistency or theme is actually a feature, and following/connecting with people (who contain multitudes) further supports this context switching world.
  4. Since controversy generates more revenue than civility, all advertising platforms have to solve for the user-generated-content problem, surfing the line between what is legal & acceptable in any given jurisdiction and what creates outrage. Good outrage corresponds to impressions, bad outrage leads to advertisers distancing themselves. Whilst there is some reputational risk, and some sites seek to advocate for their communities, the advertising that underwrites the platforms means that content moderation is ultimately for the advertisers's sensibilities.
In summary then, we live an a world of centralized discussion, with constantly shifting focus, all with opportunity for enraged engagement to drive controversy, as it is a proven model for all parties to maximize ad revenue (along with conflict and divisiveness). Ultimately this is a focus on quantity of engagement and impressions, not on the quality of discussion and betterment of people. One further characteristic of this centralized model is that the publishers, who spend time and money to produce the content referred to in the first place, are removed from the discussion beyond the reference to the original article, because news site comments can't compete with social. Sadly, studies show that nearly 60% of people will interact with content without even clicking through.

To solve these inequities and unhealthy dynamics, we asked ourselves the question "What happens if a post isn't just a post? What happens if a post could be both a post, and an in-situ comment to the original article? This is a system we call "Quanta" and here's what it is and how it works.

Rather than continue the pattern of having social networks steal the conversation from publishers, we allow publishers to pull from a social network to host the relevant conversation in-situ, as a side-bar to the main article. Publishers have tried to create comment sections, mostly unsuccessfully, since the audience is poorly qualified/filtered and required per site registration. In Quanta, the audience is brought from the social network, and comments from the relevant audience (filtered by my network as well as using criteria that the publisher can introduce) are shown along side the article - even along side relevant parts of the article.

To explain how Quanta works, let's consider the following example:

Imagine a new article on The New York Times. Today, after a short period of time, the URL will be posted on social media with comments from readers. The conversation (retweets, replies etc) will all occur on the respective social media platforms.

In Quanta, a user would ‘post’ to Mastodon, or any ActvityPub based Fediverse app, quoting the URL, and Quanta would then index and organize such referring posts. In one implementation, the publisher would include comment blocks, and specify constraints/thresholds for the comments to be shown (reflecting popularity and editorial guidelines). The comment blocks would be rendered HTML (similar to how an ad slot is populated), and would be ‘passive’ in the sense of links (to favorite, repost etc) wouldn’t activate in-situ, but would (due to Same Origin Policy) send the user to the right point on the social instance. In another, more interesting, implementation, the browser could form a more opinionated view of content rendering and, with an appropriate social handle logged into the browser, render the comments in a pane next to the article.

In short, viewing the original page produces the current set of useful comments relating to the URL. All comments displayed are contextually relevant, and additionally provide a useful tool for user discovery.

A few interesting consequences of Quanta :

  1. The publisher is now getting many more page views, because that's where the conversation about the page can be readily viewed. This increase in page views will increase the ad revenue the publishers can generate. Note - we don't consider a broader question about advertising being good or bad. We just want to shift revenue, discourse and attention to those that generate the content, away from the social networks that steal that attention.
  2. The 'conversation' is now all in context. It's not about a newsfeed and context switching, but instead the conversation is focused on the topic the publisher has written about.
  3. The duration of the conversation is longer. Doomscrolling a social media feed means that many users are just surfing that last few hours of content that's new to them. On Quanta, the published document and comments (and replies to comments) are all at the same URL, waiting for the next view. Referring to the same document days, months or years later will still have the conversation in place. We believe this will drive greater accountability of the poster and also create further re-engagement for the publisher(*).
  4. Finally, we believe Quanta will provide a great follower discovery feature. All too often on social networks an interesting comment will be amplified, and in the moment the user has to make the decision "should I follow this person?" with little information (or a detour to research). In Quanta, the context is immediate, and publishers can choose to promote comments from known experts, making their work more broadly available. 
Quanta can be implemented as a Quanta server plus a browser extension. As a server, Quanta is an extension of Mastodon, as we believe the index of posts by URL should be distributed so that Quanta itself doesn't because another large centralized system that needs advertising or subscription to succeed. We also note that this seems directionally to be aligned with how Mastodon sees news integration as being important to the platform, as shown by the recently launched Mastodon byline feature.
Quanta has been shared as a concept with key browser vendors, and we firmly believe browser integration will provide the best experience.

 

Full Self Driving Fallacy (aka, ultimately it's a user problem)

 Tesla made the earliest stunning promises of Fully Self Driving cars back in 2016 and you can read a good history of full self driving and Tesla Autopilot on the Wikipedia page.

Technology predictions are notoriously hard, but I'm going to make the case here that whilst fully autonomous (often called level 5) may be possible, the Tesla execution of incrementally adding greater autonomy whilst requiring driver presence and overall responsibility (essentially vehicle autonomy levels 1 through 4) is going to fail. It will fail not for a technical reason, but for a human and social one.

Consider the following incremental 'improvements' to driving automation (I've used bullets rather than numbers so it's not confused with the defined autonomous driving levels)

  • Traffic-Aware Cruise-Control (allows you to set a desired speed but will match with slower vehicles if they’re obstructing)
  • Autosteer (which adds the ability for the car to track within lanes)
  • Navigate on Autopilot (which was introduced in the context of highway driving, getting you from on-ramp to off-ramp, crucially being able to change lanes when the driver indicates).
  • Auto Lane Change (adding the ability to automatically change lanes on highways rather than requiring driver assistance).
  • Full Self-Driving (start to end destination auto driving by the car, with success measured in the fewest number of driver interactions).

There's also some 'point' features like Summon and AutoPark which I'm not going to discuss here. Tesla have some nuances in how these capabilities have changed over time, in particular in relation to the degree to which driver attention is measured. From requiring sensors in the steering wheel to ensure hands are present (easily circumvented) to cameras tracking eyes, Tesla has recognized that in any level of autonomy under level 5, having the driver intervene is important, and therefore they want to be sure the driver is attentive.

Let's just go down the list above and consider what degree of driver attention is needed.
  • For Traffic-Aware Cruise-Control, the driver is actively steering and having to maintain an awareness of the surrounding road, nearby vehicle proximity etc. The driver is also able to pay less attention to their right foot and the pressure it's applying, with speed being kept constant and long freeway driving as well as some stop-go traffic becoming much less tiring.
  • With Autosteer added, the driver is still actively engaged in the driving process, but freed from the immediacy of lane-drift and car following range. The driver has to plan - e.g. should I overtake when my exit is coming up in 2 miles, should I pull out now to overtake, or will the faster moving vehicle behind me have overtaken by the time I reach that point. My assessment would be that driving like this is a nice balance of the car doing the drudge work of ongoing micro-adjustments of lane placement left/right and speed placement in relation to vehicles in the same lange, whilst the driver thinks more strategically.
  • Adding Auto Lane Change in theory means slightly more strategic thinking on behalf of the driver (“Do I feel like a rest stop is a good idea in a few miles, or should I wait another 20?”), but the relative speeds of vehicles, tracking of different kinds of vehicles and speed differentials for overtaking means that the car will invariably not behave as a driver would, e.g.
    • I better speed up a little to get past this truck as it’s doing 63 and I’m set for 65, but there’s a bunch of faster moving cars a mile back that will be stacked behind my long overtake.
    • I better track a little over to the left here as that truck is wide and cutting closer to me than I would like on this curve.
    • I'm going to track a little more to the right to avoid that pothole. 
    • I’m just going to hold back here for a few minutes to let that red mustang I can see way back there get by me, as I don’t want to be involved in any craziness.
What this leads to is a mismatch between the work that the car is doing vs what the driver is doing. That is - the driver is now having to plan for what they expect their car to do in addition to those around them.
  • Full Self-Driving now takes over the lion's share of the driving work, but, crucially, requires the driver to be monitoring everything and able to take over at a moments notice if the car deems an unsafe situation has occurred, or (conversely) if the driver feels that the car is about to perform an unsafe maneuver. The driver’s role has transitioned from being fully engaged, but supported, in the driving process, to one where they are a spectator until they need to fully take over in a challenging situation that the car can’t handle.
My point of stepping through this is essentially to make the argument that once you reach a certain point of driver replacement through automation, only 100% accurate self driving is good enough at that point, since requiring driver re-engagement to troubleshoot in milliseconds is a recipe for disaster.  This seems to be borne out by some of the accident data that is coming out of NHTSA. To save you a click to the article, here's the summary :


"This analysis, conducted before Recall 23V838, indicated that drivers involved in the crashes were not sufficiently engaged in the driving task and that the warnings provided by Autopilot when Autosteer was engaged did not adequately ensure that drivers maintained their attention on the driving task. The drivers were involved in crashes while using Autopilot despite fulfilling Tesla’s pre-recall driver engagement monitoring criteria. Crashes with no or late evasive action attempted by the driver were found across all Tesla hardware versions and crash circumstances."


Unfortunately, if the metrics you measure are safe passenger miles driven by Full Self Driving, the data will lead you astray, as you will have an awful lot of miles well driven by auto-pilot, and a large number of accidents ‘caused’ by drivers after they take over.


So, my contention is that anything under full autonomous level 5 driving is going to skew to this unfortunate requirement of requiring full user attention at precisely a time when the autonomous driving is failing, all the time whilst the semi-autonomy has removed full user attention. Full autonomous level 5 driving may be feasible, since designing for the system to do everything without enabling intervention and will necessarily mean availability in known tested scenarios. There are an awful lot of 'edge' cases with level 5 that will need to be ironed out.