specific companies – Page 2

December 3, 2024December 3, 2024

Eric Schmidt’s Missing Emails and Memory Problems

In Viacom v. Google, Viacom took the deposition of Google then-CEO Eric Schmidt. In sworn testimony, Page explained why he had just 19 emails pertaining to YouTube, then Google’s largest acquisition. (Source: Hohengarten declaration ¶266) Schmidt also repeatedly reported not remembering events and facts relating to YouTube.

Viacom remarked in associated briefing: “This Court can decide whether these key executives and witnesses behaved with the level of candor and respect for the legal process that this Court has a right to expect from senior executives of important public companies.”

Schmidt’s testimony (as excerpted by Viacom).

As to document retention: “[Some] people over time either delete or lose some of that e-mail. It has been my practice for 30 years to not retain my e-mails unless asked specifically.” “[I]t has been my practice to not keep my e-mails.” “Q: And is this on some sort of automatic system where they are deleted in the ordinary course over some ordinary period of time? A: Depending on the e-mail system and the company and so forth, the answer would vary.” As to document retention while at Google: “It was my practice to delete or otherwise cause the e-mails that I had read to go away as quickly as possible. Q: Within days? A: Yes.”

Separately, Schmidt claimed he didn’t remember multiple aspects of Google’s strategy in video. A representative example:

“Q: [Y]ou are aware, I assume, that the acquisition agreement contains an indemnification provision relating to copyright lawsuits? … A: Yes. Q: And was that discussed by the board …? A: Yes. Q: And do you remember that discussion, sir? A: No.”

Other subjects Schmidt didn’t remember.

whether Google executives ever complained that YouTube was competing unfairly in its approach to copyrighted material (“I don’t recall”)
whether Google documents criticized YouTube about copyright (“I may have … – I don’t recall specifics.”)
whether he read preparation documents before a Google Product Strategy for video (“I do not recall the e-mail”)
whether he participated in discussions about Google Video policies about copyright (“I have no specific recollection in that area”)
whether Google could afford the license fee for Audible Magic technology (“I have no recollection in this area”)
whether any media companies received access to fingerprint without a licensing deal (“I don’t recall”)
certain YouTube functionality (“I don’t know what ‘private videos’ means”).

December 3, 2024December 3, 2024

Larry Page’s Bad Memory of YouTube and Google Video

In Viacom v. Google copyright litigation, Viacom took the deposition of Google co-founder Larry Page. In a sworn deposition, Page reported remembering very little about YouTube and Google video.

Page’s full deposition is available in parts 1, 2, 3.

In the list below, I summarize the substantive questions asked of Page, and quote his verbatim response to each. By my count, there were 132 distinct instances in which Page could not recall the answer to a question. (In this count and list, I excluded questions about whether Page received documents, excluded questions about why the reasons why he doesn’t remember, and excluded or grouped most repeat questions.) From my line-by-line review of Page’s deposition:

whether YouTube used human review prior to Google’s acquisition (“Don’t recall”)
whether YouTube filtered videos using search terms (“I don’t recall”)
whether YouTube used fingerprint technologies (“I don’t recall”)
whether he knew any other aspect of YouTube’s copyright practices (“I don’t recall”)
whether he ever discussed YouTube’s copyright practices with anyone (“I don’t recall”)
whether he discussed whether YouTube should indemnify Google for copyright liability (“I don’t recall”)
whether YouTube did indemnify Google for copyright liability (“I don’t recall”)
whether he ever discussed with anyone the percentage of views on YouTube that are for premium content (“I don’t recall”)
whether he participated in due diligence of YouTube (“I don’t recall”)
whether he spoke with anyone at YouTube about their copyright practices prior to acquisition by Google (“I don’t recall that”)
whether he ever talked to YouTube co-founder Chad Hurley about YouTube’s business practices (“This is a very specific question, so I can’t recall”)
whether he ever spoke with YouTube co-founder Steve Chen (“Yeah, same, same answer”)
whether he read any due diligence about YouTube (“I don’t recall any specific due diligence”)
whether he attended the board meeting when Google’s board approved acquiring YouTube (“I don’t recall that meeting or whether I was present or not”)
whether he reviewed the materials presented to the board on that occasion (“I don’t recall”)
whether he reviewed a board book for that occasion (“I don’t recall”)
which bank advised Google in connection with acquiring YouTube (“I don’t recall”)
whether he thought Google was overpaying for YouTube (“I don’t recall my, you know, exact train of thought around the price”)
what others said about the price (“I don’t recall what was said around the price”)
if anyone said the price was too high (“I actually don’t recall the discussion around price”)
compliance practices at Google Video prior to the YouTube acquisition (“Yeah, I don’t recall”)
whether Google Video was actively screening for copyrighted materials (“I don’t recall”)
whether Google Video was using human review to screen for copyrighted materials (“I don’t recall”)
whether he knew Google Video was removing certain copyrighted videos (“I don’t recall”)
whether Google senior management decided not to deploy screening at YouTube (“I don’t recall” x2)
whether Google senior management determined to cease practices at YouTube that were being used at Google Video to prevent copyright violations (“Yeah, I don’t recall” x3)
whether he attended a Google Product Strategy about Google Video (“I don’t recall”)
whether Google CEO Eric Schmidt ever raised questions about copyrighted content with Google senior leaders (“I don’t recall”)
whether Google’s senior leaders ever discussed pirated content at YouTube (“I don’t recall”)
whether Google’s senior leaders ever discussed illegal content at YouTube (“I don’t recall”)
whether Google’s senior leaders ever discussed the proportion of traffic at YouTube that is pirated (“I don’t recall”)
whether he spoke with Google CEO Eric Schmidt about what he had learned at a Google Product Strategy meeting about Google Video (“I don’t recall”)
whether Eric Schmidt told him that the Google Video team had stated in a deck that YouTube’s business is “completely sustained by pirated content” (“I don’t recall”)
what Eric Schmidt said he was told at a Google Product Strategy meeting about Google Video (“I don’t recall”)
whether he ever considered copyright piracy at YouTube (“I don’t recall”)
whether Google has sought to negotiate licensees with movie studios to put their content on YouTube and Google Video (“Yeah, I don’t recall” and “You’re asking me a very, very specific question which I then can’t recall”)
whether he was apprised of negotiations between Viacom and Google (“I don’t recall any such detail or anything”)
whether he ever heard of the phrase “Big 6 Media Companies” (“I don’t recall hearing such a phrase”)
whether Google’s senior leaders held weekly meetings to discuss the status of large transactions (“I don’t recall”)
whether there was a policy that Google’s senior leaders not be surprised regarding the status of negotiations with major content owners (“Yeah, I don’t recall of any such term”)
whether he was advised of deal terms being negotiated between Viacom and Google (“No, I don’t recall that”)
whether he recalls a possible half billion dollar revenue guarantee for Viacom, set out in a proposal sent to him by email (“No I don’t recall”)
whether Google vided Viacom’s content as the most valuable of any rights-holder (“I don’t recall any such conclusion”)
whether he was frequently kept apprised of the status of deal discussions (“I don’t recollect”)
whether Google offered Viacom a partnership which Google valued at $592 million (“I don’t recall”)
whether Google ever offered any other content provider even half as much (“I don’t recall” x2)
whether additional documents remind him the he and Google co-founder Sergey Brin were continually advised of negotiations with Viacom (“I don’t recall”)
whether Google concluded that the value of Viacom content to it was $150 million even without advertising support (“I don’t recall”)
whether he believed that the value of Viacom content to it was $150 million even without advertising support (“I don’t recall”)
what his position was about who would sell ads to appear in Viacom content (“I don’t recall my position”)
whether Google pretended to be negotiating with Viacom to buy time while completing deals with other content owners (“I don’t recall, and I add further that we have very strong ethics around not doing such things” x3)
whether he, Schmidt, and Brin ever pretended to be negotiating with Viacom to buy time while completing deals with other content owners (“Again, I don’t recall”)
whether he ever read an email from Joan Braddi about a strategy for negotiating with Viacom (“I don’t recall”)
whether he responded to a question about strategy for negotiating with Viacom (“I don’t recall”)
whether he discussed a request to “keep them warm” (“I do not recall”)
whether he discussed a strategy of buying time to complete other deals (“I don’t recall that”)
whether he was onboard with a strategy, when an email said “Larry and Sergey are on board with this as well” (“I don’t recall the details here”)
whether it was Google’s strategy to keep Viacom engaged until signing one of CBS or Turner (“I don’t recall that, no”)
whether Google’s negotiating tactics with Viacom were in good faith (“I don’t recall any specifics about Viacom”)
whether emails refresh his recollection about being kept apprised of negotiations with Viacom (“Yeah, again, I don’t recall”)
whether Google was negotiating with other film studio and TV producers at the same time it was negotiating with Viacom (“I don’t recall any specifics there”)
whether Google had ongoing negotiations to license content from Turner Broadcasting (“I don’t recall”)
whether Google had ongoing negotiations to license content from Walt Disney Company (“I don’t recall”)
whether the package Google offered to rights-holders included fingerprinting and filtering for copyright materials on YouTube (“I don’t recall”)
whether a document refreshes his recollection about YouTube’s offering of fingerprinting and metadata searches (“So I know very little about this document, whether it was signed or not, or any of those things, so it’s hard to answer that question”)
whether his approval was required for Turner Broadcasting to receive fingerprinting and other filtering techniques in 2006 (“I have no idea”)
whether he was apprised of negotiations between Google and Walt Disney (“I don’t recall”)
whether he knew that in 2006, Google was offering to pay a fingerprinting vendor to check for Walt Disney’s content on YouTube (“I don’t recall”)
whether his approval would be required for Google to pay a fingerprinting vendor to check for Walt Disney’s content on YouTube (“I don’t recall”)
whether he saw a termsheet for a proposed transaction between MTV and Google (“No, I don’t recall that”)
whether he was advised in December 2006 that as part of a license agreement, Google was offering to provide audio fingerprinting to Viacom for content on YouTube (“I don’t recall”)
whether in 2006 it was the express policy of Google’s senior management not to offer fingerprinting or filtering to content owners unless they entered into a revenue-sharing agreement with Google (“I don’t recall”)
whether he recalls discussing with senior leaders that Google would not offer fingerprinting to any other content owner without a license (“I don’t recall that”)
whether Google vice president David Eun was accurately portraying Google policies as authorized by senior management when he said fingerprinting tools would only be provided as part of a revenue deal (“I have no recollection of that”)
whether he was advised in Fbruary 2007 that that was Google’s policy (“Yeah, I don’t – I don’t recall that”)
whether the senior leadership of Google considered cooperating with Viacom or NBC to use fingerprinting to keep their content off of YouTube (“I don’t recall” x2)
whether he would expect senior leaders of Google to discuss approaches to negotiations with two of the largest entertainment companies in the world (“I don’t recall any such discussion”)
whether he called discussing certain points on a memo addressed to him (“I don’t recall discussing page 25”)
whether he recalled discussing with other senior leaders of Google whether the company should threaten to change its copyright policy to get a standard deal with content-owners (“Again, I don’t recall”)
whether he recalled being advised that entire movies were being uploaded to YouTube (“No, I don’t recall”)
whether he recalled Spider Man III being uploaded to YouTube (“I don’t recall”)
whether he recalled Cars being uploaded to YouTube (“I don’t recall”)
whether he recalled Meet the Robinsons being uploaded to YouTube (“I don’t recall”)
whether he recalled Pans Labyrinth being uploaded to YouTube (“I don’t recall”)
whether he recalls being advise of any movie uploaded in its entirety to YouTube in 2007 (“I don’t recall”)
whether Google’s senior leaders knew that Google was in discussion with the MPAA regarding copyright (“I don’t recollect that”)
whether the senior leaders of Google authorized staff to tell the MPAA that Google would refuse to filter or use fingerprinting in the absence of a deal with members of the MPAA (“I don’t recall”)
whether he recalls learning in February 2007 about Viacom requesting a large takedown of its content from YouTube (“I don’t recall” and “I already said that I don’t recall”)
whether a document refreshes his recollection that Viacom had requested a large takedown of its content from YouTube (“I don’t recall”)
whether YouTube makes a copy of an uploaded file onto a YouTube server (“I’m not familiar with exactly what YouTube does with uploads”)
whether YouTube creates a searchable index of its site (“I have no specific knowledge of that”)
whether he told investors that YouTube has distribution contracts for other platforms (“I don’t recall”)
whether YouTube has distribution contracts or Motorola devices (“I don’t recall”)
whether Google senior leaders discussed, in his presence, YouTube’s distribution contracts with Apple (“I don’t recall”)
whether Google senior leaders discussed, in his presence, YouTube’s distribution contracts with Sony (“I have no in-depth knowledge of these things”)
whether Google senior leaders had discussions regarding contracts with Apple, Sony, Panasonic, Cingular, Verizon, or Vodaphone (“I don’t recall any specific discussions, no”)
whether he recalls what Credit Suisse told the Google Board as to YouTube’s monetization model (“Yeah, I don’t recall”)
whether anyone quantified for Google leaders the benefit of additional monetizable traffic to YouTube (“I don’t recall”)
whether YouTube gets searching preference on Google (“I’m not aware of the details”)
whether he favored Google’s acquisition of YouTube (“I don’t remember my exact thinking around the time. I don’t think I was tremendously upset by it”)
whether he was in favor of the transaction or against it (“I don’t remember being upset about it, so my guess is I was more positive than negative”)
whether his view of the transaction changed over time (“I don’t recall”)
whether he discussed his view of the transaction with Google co-founder Sergey Brin (“I don’t recall such a situation, but I’d be surprised if I didn’t”)
whether he has a videotape or audiotape of a conversation with Brin about whether to acquire YouTube (“not that I can recall”)
whether he discussed the transaction with Google CEO Eric Schmidt (“Like I said, I don’t recall any specific discussions around it. Again, I’d be surprised if I didn’t.”)
whether he had any general discussions with Schmidt about acquiring YouTube (“I don’t recall”)
whether he was generally in favor of acquiring YouTube (“I said I don’t remember being against it”)
whether he was familiar with allegations of copyright infringement at YouTube, other than information obtained via counsel (“I don’t recall”)
whether he had any conversation with Schmidt, either before or after acquiring YouTube, in which they discussed copyright infringement (“I don’t recall” x2)
whether he had any conversation with Brin, either before or after acquiring YouTube, in which they discussed copyright infringement (“I don’t recall”)
whether he ever directed anyone to determine whether YouTube was violating copyrights (“Again, I’m not an expert on the operation of YouTube, but I don’t recall any such thing” x3)
whether he ever remarked “There seemed to be a lot of allegations that YouTube may be infringing copyright. I’d like you to find out about it and report to me”, or words to that effect (“I don’t recall doing that, no”)
when he first logged in to YouTube (“I don’t know. I assume some time around the acquisition, but I don’t recall that”)
approximately when he first logged in to YouTube (“I mean, no, I can’t record recall that”)
whether he recalls what caused him to first go to the YouTube web site (“No”)
whether he recalls discussing with anyone what he saw on the YouTube web site when he first went there (“No”)
whether he discussed with Brin what he saw on the YouTube web site (“I can’t recall any specific instance, no”)
whether he ever saw anything on the YouTube site which he thought was improper (“I can’t recall that, no”)
when he last visited the YouTube site (“Probably pretty recently, but I can’t remember exactly”)
whether his visits to YouTube ever caused him to think about whether there were copyright infringements on that site (“I don’t recall that, no”)
whether he recalls being involved in a discussion about the value of Premier League broadcasts (“No”)
whether he recalls being involved in a written discussion about the value of Premier League broadcasts (“Like I said, I don’t recall”)
whether he recalls communications with representatives of English Premier Football (Soccer) (“I don’t recall”)
whether he recalls communications about whether English Premier Football (Soccer) should be paid (“I don’t recall”)
whether he recalls communications about the desirability of licensing content from English Premier Football (“I don’t recall and, you know, there’s a lot of these kinds of things”)
whether he has any memory of discussion with English Premier Football at all (“No”)
whether he learned in advance that a licensing deal with Walt Disney Company was going to be entered into (“Yeah, I don’t recall specifics there”)
whether he discussed with Schmidt the prospect of a licensing deal with Walt Disney Company (“I don’t recall”)
whether he recalls participating in any discussions about the financial value to Google of acquiring YouTube (“I don’t recall much of any real financial analysis. In general, for startups, it’s a pretty difficult thing.”)
whether he recalls participating in conversations with Schmidt or Brin about the economic value to Google of acquiring YouTube (“Again, I don’t recall the specifics around those discussions”)
whether he recalls discussions with Schmidt or Brin about the competitive advantage to Google in acquiring YouTube (“I mean, I already I said I can’t remember specifics. I can’t remember anything about the competitive issues.”)
whether he recalls discussions with Schmidt or Brin about the disadvantages of Google Video compared to YouTube (“I can’t remember any specifics about discussions with them and about Google Video, no.” x2)

December 3, 2024December 3, 2024

The 2008 Walker Memo

In recent competition proceedings against Google, multiple courts have discussed whether Google staff were forthright in their remarks (not to mention whether the company preserved and produced documents in accordance with court rules). See Google Discovery Violations. Discussions often turn back to a memo sent by Google General Counsel Kent Walker as well as Bill Coughran (then SVP of Engineering at Google). In the dockets for these cases, this document appears in unformatted monospace with broken spacing between paragraphs, and in image form without indexing or searchability. I am therefore reposting it here in plain text.

Sent: September 16, 2008 2:09 PM
Subject: Business communications in a complicated world
Confidential/Please Do Not forward

Googlers –

As you know, Google continues to be in the midst of several significant legal and regulatory matters, including government reviews of our deal with Yahoo!, various copyright, patent, and trademark lawsuits, and lots of other claims. Given our continuing commitment to developing revolutionary products and doing disruptive things, we’re going to keep facing these kinds of challenges. So we’ve got two requests of you and one change to announce.

First, please write carefully and thoughtfully. We’re an email and instant-messaging culture. we conduct much of our work online. We believe that information is good. But anything you write can become subject to review in legal discovery, misconstrued, or taken out of context, and may be used against you or us in ways you wouldn’t expect. Writing stuff that’s sarcastic, speculative, or not fully informed inevitably creates problems in litigation. In your communications, please avoid stating legal conclusions. Speculation about whether something might breach a complex contract, or whether it might violate a law somewhere in the world, is often wrong and rarely helpful. So please do think twice before you write about hot topics, don’t comment before you have all the facts, and direct questions regarding continuing litigation holds and any legal and/or regulatory matters involving Google to the friendly (albeit lawyerly) folks at [redacted]@google.com

Second, remember that these same rules apply not just to Gmail but also to Google Talk and all other forms of electronic communication (for example wiki’s, doc’s, spreadsheets, etc.). We end up reviewing millions of pages of these communications as part of producing documents in regulatory and litigation matters — and we’re working together to streamline and simplify that process.

To help avoid inadvertent retention of instant messages, we have decided to make “off the record” the Google corporate default setting for Google Talk. We’ll also be providing this option to our Google Apps enterprise customers. You should see this new default setting taking effect over the next few days. You will still be able to save Talk conversations that are useful to you — but please remember that “on the record” conversations become part of your (more or less) permanent record and are added to Google’s long-term document storehouse. If you’ve received notice that you’re subject to a litigation hold, and you must chat regarding matters covered by that hold, please make sure that those chats are “on the record”.

Finally, remember that even when you’re “off the record”, your chat partner may be recording the conversation, so always take care with what you write. Thanks for your help and understanding on this. Let one of us know if you have any questions.

Bill Coughran

Kent Walker

See also Antitrust Basics for Search Team (March 2011), recommending word choice for Google employees discussing competition matters.

November 19, 2024January 29, 2025

Impact of GitHub Copilot on code quality

Jared Bauer summarizes results of a study I suggested this spring. 202 developers were randomly assigned GitHub Copilot, while the others were instructed not to use AI tools. The participants were asked to complete a coding task. Developers with GitHub Copilot had 56% greater likelihood of passing all unit tests. Other developers evaluated code to assess quality and readability. Code from developers with GitHub Copilot was rated better on readability, maintainability, and conciseness. All these differences were statistically significant.

August 1, 2024August 1, 2024

The Effect of Microsoft Copilot in a Multi-lingual Context with Donald Ngwe

We tested Microsoft Copilot in multilingual contexts, examining how Copilot can facilitate collaboration between colleagues with different native languages.

First, we asked 77 native Japanese speakers to review a meeting recorded in English. Half the participants had to watch and listen to the video. The other half could use Copilot Meeting Recap, which gave them an AI meeting summary as well as a chatbot to answer questions about the meeting.

Then, we asked 83 other native Japanese speakers to review a similar meeting, following the same script, but this time held in Japanese by native Japanese speakers. Again, half of participants had access to Copilot.

For the meeting in English, participants with Copilot answered 16.4% more multiple-choice questions about the meeting correctly, and they were more than twice as likely to get a perfect score. Moreover, in comparing accuracy between the two scenarios, people listening to a meeting in English with Copilot achieved 97.5% accuracy, slightly more accurate than people listening to a meeting in their native Japanese using standard tools (94.8%). This is a statistically significant difference (p<.05). The changes are small in percentage point terms because the baseline accuracy is so high, but Copilot closed 38.5% of the gap to perfect accuracy for those working in their native language (p<0.10) and closed 84.6% of the gap for those working in (non-native) English (p<.05).

Summary from Jaffe et al, Generative AI in Real-World Workplaces, July 2024.

May 24, 2024May 25, 2024

Impact of M365 Copilot on Legal Work at Microsoft

Teams at Microsoft often reflect on how Copilot helps. I try to help these teams both by measuring Copilot usage in the field (as they do their ordinary work) and in lab experiments (idealized versions of their tasks in environments where I can better isolate cause and effect). This month I ran an experiment with CELA, Microsoft’s in-house legal department. Hossein Nowbar, Chief Legal Officer and Corporate Vice President, summarized the findings in a post at LinkedIn:

Recently, we ran a controlled experiment with Microsoft’s Office of the Chief Economist, and the results are groundbreaking. In this experiment, we asked legal professional volunteers on our team to complete three realistic legal tasks and randomly granted Copilot to some participants. Individuals with Copilot completed the tasks 32% faster and with 20.3% greater accuracy!

Copilot isn’t just a tool; it’s a game-changer, empowering our team to focus on what truly matters by enhancing productivity, elevating work quality, and, most importantly, reclaiming time.

All findings statistically significant at P<0.05.

Full results.

December 5, 2023December 6, 2023

Early LLM-based Tools for Enterprise Information Workers Likely Provide Meaningful Boosts to Productivity

Early LLM-based Tools for Enterprise Information Workers Likely Provide Meaningful Boosts to Productivity. Microsoft Research Report – AI and Productivity Team. With Alexia Cambon, Brent Hecht, Donald Ngwe, Sonia Jaffe, Amy Heger, Mihaela Vorvoreanu, Sida Peng, Jake Hofman, Alex Farach, Margarita Bermejo-Cano, Eric Knudsen, James Bono, Hardik Sanghavi, Sofia Spatharioti, David Rothschild, Daniel G. Goldstein, Eirini Kalliamvakou, Peter Cihon, Mert Demirer, Michael Schwarz, and Jaime Teevan.

This report presents the initial findings of Microsoft’s research initiative on “AI and Productivity”, which seeks to measure and accelerate the productivity gains created by LLM-powered productivity tools like Microsoft’s Copilot. The many studies summarized in this report, the initiative’s first, focus on common enterprise information worker tasks for which LLMs are most likely to provide significant value. Results from the studies support the hypothesis that the first versions of Copilot tools substantially increase productivity on these tasks. This productivity boost usually appeared in the studies as a meaningful increase in speed of execution without a significant decrease in quality. Furthermore, we observed that the willingness-to-pay for LLM-based tools is higher for people who have used the tools than those who have not, suggesting that the tools provide value above initial expectations. The report also highlights future directions for the AI and Productivity initiative, including an emphasis on approaches that capture a wider range of tasks and roles.

Studies I led that are included within this report:

December 5, 2023August 1, 2024

Randomized Controlled Trials for Microsoft Copilot for Security with James Bono, Sida Peng, Roberto Rodriguez, and Sandra Ho. updated March 29, 2024.

Randomized Controlled Trials for Microsoft Copilot for Security. SSRN Working Paper 4648700. With James Bono, Sida Peng, Roberto Rodriguez, and Sandra Ho.

We conducted randomized controlled trials (RCTs) to measure the efficiency gains from using Security Copilot, including speed and quality improvements. External experimental subjects logged into a M365 Defender instance created for this experiment and performed four tasks: Incident Summarization, Script Analyzer, Incident Report, and Guided Response. We found that Security Copilot delivered large improvements on both speed and accuracy. Copilot brought improvements for both novices and security professionals.

(Also summarized in What Can Copilot’s Earliest Users Teach Us About Generative AI at Work? at “Role-specific pain points and opportunities: Security.” Also summarized in AI and Productivity Report at “M365 Defender Security Copilot study.”)

December 5, 2023August 1, 2024

Sound Like Me: Findings from a Randomized Experiment with Donald Ngwe

Sound Like Me: Findings from a Randomized Experiment. SSRN Working Paper 4648689. With Donald Ngwe.

A new version of Copilot for Microsoft 365 includes a feature to let Outlook draft messages that “Sound Like Me” (SLM) based on training from messages in a user’s Sent Items folder. We sought to evaluate whether SLM lives up to its name. We find that it does, and more. Users widely and systematically praise SLM-generated messages as being more clear, more concise, and more “couldn’t have said it better myself”. When presented with a human-written message versus a SLM rewrite, users say they’d rather receive the SLM rewrite. All these findings are statistically significant. Furthermore, when presented with human and SLM messages, users struggle to tell the difference, in one specification doing worse than random.

(Also summarized in What Can Copilot’s Earliest Users Teach Us About Generative AI at Work? at “Email effectiveness.” Also summarized in AI and Productivity Report at “Outlook Email Study.”)

December 5, 2023August 1, 2024

Measuring the Impact of AI on Information Worker Productivity with Donald Ngwe and Sida Peng

Measuring the Impact of AI on Information Worker Productivity. SSRN Working Paper 4648686. With Donald Ngwe and Sida Peng.

This paper reports the results of two randomized controlled trials evaluating the performance and user satisfaction of a new AI product in the context of common information worker tasks. We designed workplace scenarios to test common information worker tasks: retrieving information from files, emails, and calendar; catching up after a missed online meeting; and drafting prose. We assigned these tasks to 310 subjects tasked to find relevant information, answer multiple choice questions about what they found, and write marketing content. In both studies, users with the AI tool were statistically significantly faster, a difference that holds both on its own and when controlling for accuracy/quality. Furthermore, users who tried the AI tool reported higher willingness to pay relative to users who merely heard about it but didn’t get to try it, indicating that the product exceeded expectations.

(Also summarized in What Can Copilot’s Earliest Users Teach Us About Generative AI at Work? at “A day in the life” and “The strain of searching.” Also summarized in AI and Productivity Report at “Copilot Common Tasks Study” and “Copilot Information Retrieval Study.”)