Microsoft · Copilot in Word Web

Designing AI as a
learning partner

College students are using generative AI to get things done faster, but faster isn't always better when learning is the goal. I worked with a Microsoft team to redesign the Copilot experience in Word Web to support students' thinking, not replace it. The result: Word Board, a Copilot-assisted brainstorming space embedded directly in the document workflow.

Role
Product Designer
Timeline
2025-2026
Scope
GenAI, Human-AI Interaction, EdTech
01Context

Generative AI has fundamentally changed how students learn, or avoid learning.

Students aren't going to stop using AI. The question becomes whether it makes them better thinkers or just faster ones.

NPR/KQED: The risks of AI in schools
USC Today: AI is changing how students learn
CNN: Is AI schooling the future of education?

The statistics

Our research confirmed what the headlines were reporting, at the scale and specificity needed to design into it.

National survey · 202490%

of US college students use AI in their current academic workflow.

Our survey · n=7293%

of respondents used genAI at least once in the past week.

Customer feedback · n=1,6791 in 3

students feel confident about how AI actually uses their data.

02The Problem

AI is making students more productive, at the cost of their own thinking.

Generative AI tools have become deeply embedded in academic workflows. But generating finished content on demand bypasses the cognitive struggle that actually leads to learning. The Copilot experience, like other generative AI tools, was optimized for speed, not understanding.

What students were struggling with

Trust & accuracy

Couldn't verify whether outputs were accurate or hallucinated

Ownership & voice

Felt their ideas were being replaced, not supported

Workflow fit

Copilot didn't fit naturally into how students actually write

How might we
reframe the Copilot AI experience in Word Web as a complementary assistant that supports college students' learning?
03The Solution
Introducing
Word Board

Word Board lives alongside the word document, giving students a canvas to brainstorm, connect ideas, and use Copilot as a thinking partner, without losing sight of their own work.

Feature Walkthrough
01 / 12

Entering Word Board & Adding a Card

Students open Word Board directly from within Word Web. Copilot generates an initial idea card from the document context. Copilot-authored cards are visually distinct from student-authored ones from the very start.

02 / 12

Uploading Files

Students can upload files, PDFs, documents, web links, directly to the Board. Copilot reads the uploaded content and generates idea cards from it, turning source material into brainstorming fuel.

03 / 12

Different Types of Cards

The Board supports multiple card types: text cards (student-authored ideas), link cards (web sources), file cards (uploaded documents), and Copilot cards (AI-generated). Each has a distinct visual treatment so the source of every idea is always clear.

04 / 12

Creating New Connected Cards

Students can manually draw connections between cards to build their own argument structure. Connecting two cards signals a relationship, the student defines what the connection means and how ideas relate.

05 / 12

Creating New Connected Cards with Copilot

Students can ask Copilot to generate a new card connected to an existing one, expanding a single idea into a cluster of related thoughts. Copilot suggests; the student decides which connections to keep.

06 / 12

Regenerating Cards with Copilot

If a Copilot-generated card doesn't quite fit, students can regenerate it. Instead of replacing the original, regeneration spawns a variation alongside it, so nothing is lost and students can compare options.

07 / 12

Generating Document Text from Cards & Tracing Card Origin

When a student is ready to write, they can select cards on the Board and ask Copilot to generate Word document prose from them. The writing is grounded in the student's own ideas, making the output feel earned, not outsourced.

08 / 12

Tracing Copilot Text to Card Origin

Students can click any Copilot-generated sentence in the document and trace it directly back to the board card it came from. This audit trail supports academic integrity and helps students understand exactly how AI contributed.

09 / 12

Connecting & Grouping Cards with Copilot

The relationship is bidirectional. From within the document, students can ask Copilot to generate Board cards from selected text, turning what they've already written into raw material for further brainstorming.

10 / 12

Copilot Reasoning

Every Copilot action on the Board is logged in a prompt history panel, what was asked, what was generated, and when. Students can revisit past prompts to understand how their session evolved and replay any step.

11 / 12

Prompt History

Students can expand any Copilot card to see the reasoning behind it, why Copilot made that connection, what context it drew on. Making AI thinking visible reduces blind trust and helps students critically evaluate each suggestion.

12 / 12

Responsive Screens

Word Board adapts across screen sizes. On smaller screens, the Board collapses into a focused single-column view, and the document/board toggle remains accessible, ensuring the core workflow holds regardless of device.

04Process

How did I get here?

We knew we wanted to explore the role of generative AI because it has become deeply embedded in how students learn, research, and write. As graduate students ourselves, we were also interested in understanding how these tools were affecting the academic experience beyond simply making tasks faster.

01
Mixed-Methods Research

Understanding how students actually think with AI

We ran four research methods in parallel: a customer feedback NLP analysis (1,679 public Copilot entries), a competitive audit of 6 AI writing products, a 72-response quantitative survey, and 12 semi-structured interviews at 45 minutes each. The breadth was intentional as we wanted to triangulate across what students say, what they do, and where existing tools succeed and fail.

Product1,679
Feedback Analysis
public entries
Product6
Competitive Audit
AI writing products
User72
Survey
cleaned responses
User12
Interviews
participants · 45 min

Three themes from 12 interviews & ~600 affinity map sticky notes

Theme 01
Time Pressure

Under deadline, students bypass their own thinking entirely and turn to AI to get things done fast, skipping the cognitive effort that leads to learning.

"I have a paper due in three hours. I just need to get something down. I'll think about it later."

Theme 02
Confidence & Expertise

Students rely on genAI most when they feel least confident in their own skills, especially for writing. This creates a cycle where AI replaces skill-building rather than scaffolding it.

"I second-guess everything I write. At least when Copilot says it, it sounds like it knows what it's talking about."

Theme 03
Ownership & Control

Despite heavy AI use, students consistently wanted to preserve their own ideas, voice, and contribution. When the final product no longer felt "theirs," it created real discomfort.

"I want the idea to come from me. AI can help me get it out, but the thought has to be mine."

Quantitative findings

Survey · n=72

Brainstorming is the #1 use case for genAI

82% of respondents use genAI primarily to brainstorm, before writing a single word. That moment, when students open a blank doc and immediately reach for AI, was the gap we were designing to close.

Survey · n=72 · confidence

54% seek AI validation to feel confident in their work

Over half of respondents said they seek AI validation to feel confident about their decisions, not for efficiency, but for reassurance. This drove D-01 (Visibility) and D-03 (User-led Evaluation) as priority requirements.

Customer Feedback NLP · n=1,679

Public reviews revealed three recurring pain points

NLP analysis of public Copilot feedback confirmed the same tensions at scale, benchmark comparisons, privacy distrust, and unmet expectations, pointing to a product that hadn't yet earned student trust.

GenAI usage in coursework (n=72)
Brainstorming ideas82%
Explaining concepts74%
Editing & proofreading68%
Drafting written content56%
Generating outlines51%
Survey · n=72
"I seek AI validation to feel confident about my decisions"
54%
Agree or strongly agree
25%
Disagree or strongly disagree
Strongly Agree
24
Agree
15
Neither
15
Disagree
9
Str. Disagree
9
Top themes · public Copilot reviews · NLP · n=1,679
Comparison to Competitors249 occurrences

Users benchmark Copilot against ChatGPT, Gemini, and others constantly

Privacy & Transparency93 occurrences

Users express strong distrust about how Copilot handles their data

Trust & Expectations66 occurrences

Mismatch between what users expect and what Copilot actually delivers

02
Synthesis · Affinity Mapping · Journey Mapping

Turning research into design direction

We mapped findings from all four research methods into an affinity map (~600 sticky notes), then identified clusters, translated clusters into user needs, and user needs into four design requirements. Two user groups emerged: The Learner, less confident, high AI dependency risk, and The Expert, confident, uses AI tactically for efficiency.

A journey map across five writing phases (Brainstorming, Synthesis, Outline, Drafting, Refining) showed where AI was displacing student cognition rather than supporting it. The brainstorming phase was the biggest gap, students were reaching for AI before they'd even formed an opinion of their own.

D-01
Visibility
Visibility into Copilot's accuracy and thinking, so users can verify content throughout their workflow.
D-02
Differentiation
Clear distinction between AI-generated and user-authored content, preserving a sense of ownership.
D-03
User-led Evaluation
Avenues for students to apply their own subject-matter knowledge when evaluating AI output.
D-04
Complementary Workflow
An experience that fits naturally into how students write, from brainstorming through drafting.
Persona 01, The Expert (primary)
Primary persona: The Expert, subject matter expert student
Persona 02, The Learner
Secondary persona: The Learner

We designed primarily for The Expert, a student with genuine subject-matter knowledge who risks having their expertise displaced by AI rather than supported by it.

Journey map, five writing phases · where AI was displacing cognition Journey map across five student writing phases
03
Ideation · Concept Testing · Design Crit

From 123 ideas to 4 concepts

With the brainstorming gap clearly defined, we ran several ideation sessions using Crazy 8s, SCAMPER, and user story exercises to generate a wide range of directions. We then clustered ideas around our four design requirements and pressure-tested them against edge cases before narrowing to four concepts worth developing.

Crazy 8s
Crazy 8 ideation sketches
SCAMPER
SCAMPER ideation exercise
User Stories
User story mapping

4 concepts that best addressed the problem space

Each concept addressed the core tension, supporting student thinking without replacing it, but approached it from a different angle.

Concept A
AI Reflection Tool

Shows AI contributions to the paper broken down by category, helping students reflect on their reliance on genAI and maintain ownership of their writing process.

AI Reflection Tool
Concept B
Argument Refinement Tool

Highlights claims, evidence, and reasoning in the document, as well as weak points, helping students strengthen their arguments through rubric-aligned feedback and source discovery.

Argument Refinement Tool
Concept C
Work Repository

A consolidated space of instructor and student-uploaded sources and AI guidelines that a student can access at any point in their writing process, grounding AI use in course-specific context.

Work Repository
Concept D · Selected
Brainstorming Map

A visual workspace where students turn messy ideas into structured arguments by connecting thoughts, sources, and emerging themes, before drafting. Keeps Copilot in a supporting, not directing, role.

Brainstorming Map, selected concept

Concept testing results · Students + 18 Microsoft designers

The Brainstorming Map received the most interest, students said it aligned with how they actually brainstorm. Building on the strength of the Copilot ecosystem, users appreciated that it created a consolidated place for scattered thoughts, notes, and sources, making their workflow more streamlined. They also wanted the ability to inquire further into AI-generated content, which directly shaped the Reasoning and Prompt History features.

Other concepts received positive feedback but required significant professor involvement to implement, and, most importantly, risked displacing instructor expertise rather than complementing it. The board was the only concept that kept the intellectual work squarely with the student.

04
Qualitative Usability Testing

Testing with real students under real conditions

A usability study with 6 college students (60 minutes, remote) used the Single Ease Question scale (1 = Very Difficult · 7 = Very Easy) to score each task. Core Board interactions scored exceptionally well. Opening the board and regenerating cards revealed meaningful friction, and became the focus of the next iteration. Notably, the low scores on early interactions didn't surface in our Microsoft design crit, it took real students, under task pressure, to find them.

SEQ Task Difficulty Scores · 1 = Very Difficult · 7 = Very Easy
Opening Board & creating an initial card
4.33
Creating and regenerating new cards
3.50
Generating & tracing Doc text from cards
6.00
Generating cards from the Word Document
6.83
Connecting ideas with AI
6.83
Needs work (1–5)
Acceptable (5–6)
Good (6–7)
What went well

Board ↔ Doc relationship

Students immediately understood how cards connected to document text and back.

What went well

AI vs. manual distinction

Visual card differences between Copilot and student content were recognized immediately.

Iteration needed

Reasoning discoverability

The reasoning dropdown was rarely found, users expected it near the prompt itself, not in a panel.

05Outcome
Presented to Microsoft Copilot Product Team

A version of Copilot that makes students better thinkers, not faster typists.

Word Board was validated through rounds of expert critique and usability testing before being presented to the Microsoft Copilot product team. The Board-to-Document tracing and visual AI differentiation were the features most likely to carry forward in future explorations.

Students who interacted with Word Board described feeling more confident in their work, not because AI did it for them, but because it helped them trust their own process. The most important finding wasn't a metric, it was that the experience preserved the feeling of authorship.