Part 2

How GitHub Copilot Thinks: Mental Models & Context

ByteStrike's Problem: Code Prediction Under Uncertainty

ByteStrike's decoder is working, but the output is messy. Comments and variable names affect what Copilot suggests. Variable names affect everything. This part teaches why, so you can guide Copilot intentionally.

Understanding the Machine Behind the Suggestions

To use GitHub Copilot effectively, you need to understand how it "thinks": not in the sense that it has human-like consciousness, but in terms of how it processes information and generates suggestions. This chapter explores the mental models and context mechanisms that drive Copilot's recommendations.

When you understand these mechanisms, you'll write better prompts, anticipate where Copilot might go wrong, and know exactly how to steer it back on track.

Learning Objectives

Understand token context and model limitations that shape Copilot's "awareness" of your code
Learn how prompts influence suggestions through code, comments, and file context
Identify why Copilot succeeds or fails in predicting your intent
Master context engineering to guide Copilot toward better outcomes
Recognize context leakage and how to prevent it

The Token Context Window

At its core, Copilot (and all modern language models) processes code as a sequence of tokens (roughly equivalent to words, but more granular). Each model has a context window: a maximum number of tokens it can "see" at once.

Think of the context window like a window into your codebase. Copilot can only see what's visible through that window. If your file is large or you're working in a repository with many dependencies, Copilot might not see the full picture.

Practical implications:

Very long files (1000+ lines) can confuse Copilot because relevant context gets pushed out of the window
Comments and variable names are crucial because they "explain" context succinctly
Imports and type annotations help Copilot understand the frameworks and libraries you're using
Function signatures (names, parameters, return types) are high-value context that guide suggestions

Example: Context Window Effect

Consider a large Python file with many utility functions. If you're at the bottom of the file trying to write a new function, Copilot might not have "seen" a helper function defined at the top. This can result in Copilot re-implementing logic that already exists elsewhere in the file.

Solution: Keep related functions close together, or use clear comments to summarize context. For instance:

# Helper: Uses the existing format_date() function from above
def generate_report(data):

How Prompts Shape Suggestions

Copilot's suggestions are strongly influenced by what you provide as input. This input comes from three sources:

1. Code Context (Preceding and Following Code)

Copilot analyzes the code above the cursor (what you've already typed) and sometimes looks ahead to infer structure.

Example:

def process_user_data(user_id):
    user = fetch_user(user_id)
    # At this point, Copilot knows about 'user' and will suggest operations
    # relevant to the user object

2. Comments and Docstrings

Comments are gold for Copilot. Explicit instructions in comments dramatically improve suggestion quality.

Weak prompt:

def process(data):
    # Some processing here

Strong prompt:

def process(data):
    # Iterate through data list, filter out None values, 
    # convert each item to uppercase, and return as a new list

3. File Metadata (Language, Imports, Class/Function Signatures)

Copilot uses imports, class definitions, and function signatures as context clues about your coding style and the libraries you're using.

Example: If Copilot sees import numpy as np at the top of your file, it will suggest NumPy idioms rather than vanilla Python loops.

Why Copilot Succeeds and Why It Fails

Copilot Succeeds When:

Context is clear: Function names, variable names, and comments align well with what you're trying to do
Patterns are common: The code follows standard idioms (e.g., iterating over a list, writing a CRUD operation)
Context is local: All the information Copilot needs is visible within a few lines
Examples exist in training data: The pattern is well-represented in public code repositories

Copilot Fails When:

Context is ambiguous: Variable names are unclear; the purpose of a function is not self-evident
Context is distant: Key information is many lines away, outside the context window
Patterns are rare: The code is highly custom or domain-specific; few similar examples exist in training data
Requirements are complex: Multi-step logic or edge cases require human reasoning

Context Engineering: Steering Copilot Right

Context engineering is the art of structuring your code and comments to guide Copilot toward better suggestions. Here are core techniques:

1. Use Descriptive Names

Bad: def f(x):

Good: def calculate_total_revenue(sales_data):

2. Add Type Annotations

from typing import List, Dict

def aggregate_user_metrics(user_ids: List[int]) -> Dict[int, float]:
    # Copilot now knows input and output types, improving suggestions

public Dictionary<int, double> AggregateUserMetrics(List<int> userIds)
{
    // Type safety guides Copilot's reasoning
}

/**
 * @param {number[]} userIds
 * @returns {Map<number, number>}
 */
function aggregateUserMetrics(userIds) {
    // JSDoc types guide Copilot toward Map lookups instead of ad-hoc objects
}

3. Provide Examples or Patterns

If you want Copilot to follow a specific pattern, show it an example first:

# Example: Convert list of integers to list of their string representations
numbers = [1, 2, 3]
string_numbers = [str(n) for n in numbers]

# Now apply this pattern to another list
user_ids = [101, 102, 103]
user_id_strings = # Copilot will suggest the list comprehension pattern

4. Use Docstrings to Explain Intent

def validate_email(email: str) -> bool:
    """
    Check if email is valid using standard rules:
    - Must contain exactly one '@' symbol
    - Must have non-empty local and domain parts
    - Domain must contain at least one '.'
    
    Returns:
        True if valid, False otherwise
    """

5. Place Related Code Together

Group functions that work together. If Copilot sees related patterns nearby, it can reuse them.

# Utility functions for data parsing
def parse_csv_line(line: str) -> List[str]:
    ...

def parse_json_object(json_str: str) -> Dict:
    # Copilot is primed with parse patterns
    ...

When Context "Leaks" or Fails

Sometimes Copilot uses unexpected context, leading to incorrect suggestions. Here's how to diagnose and fix it:

Symptom: Copilot Suggests Code That References Objects You Haven't Defined

Cause: Copilot assumed a dependency or helper function based on naming patterns, but it doesn't exist.

Fix: Be explicit about available imports and be more specific in your comment. For instance, instead of a vague comment like "fetch data," write "fetch data using the requests library and json.loads()".

Symptom: Copilot Suggests the Wrong Pattern

Cause: Your code context is similar to a different pattern in Copilot's training data, and it's matching on surface similarity rather than true intent.

Fix: Add a more explicit comment or docstring that clarifies your intent. Show examples of what you expect.

Lab 2: ByteStrike's Context Experiments

Your task: Improve ByteStrike's decoder using context engineering, without changing the core logic.

Experiments:

Variable naming: Rename x to secret in the decoder loop. Watch how Copilot's suggestions change.
Docstrings: Add a detailed docstring to the extract_secrets() function. Observe how Copilot infers error handling.
Comments as signal: Add inline comments like # Extract text between {* and *}. Suggest the next line to Copilot and note the quality.
File structure: Create two versions of the decoder in the same workspace (e.g., decoder_v1.py and decoder_v2.py with better naming). Compare Copilot's output in each.

What you'll learn: Copilot isn't "guessing"; it's pattern-matching on context. Better context = better code. This lesson applies to everything ByteStrike writes from now on.

Task 1: Context Sensitivity

Create a file called context_experiment.py

Step 1: Write minimal context

def transform(data):
    # TODO: implement

Note Copilot's suggestion. Is it clear? Probably not very helpful.

Step 2: Add type hints and a better comment

def transform(data: List[Dict[str, Any]]) -> List[str]:
    # Extract the 'name' field from each dictionary and return as a list of strings

See how much more relevant Copilot's suggestion becomes.

Create a file called ContextExperiment.cs

Step 1: Write minimal context

public List<string> Process(List<object> data)
{
    // TODO: implement
}

Note Copilot's suggestion. Is it clear? Probably not very helpful.

Step 2: Add better type information and descriptive names

public List<string> ExtractNames(List<Dictionary<string, object>> data)
{
    // Extract the 'name' field from each dictionary and return as a list of strings
}

See how much more relevant Copilot's suggestion becomes.

Create a file called contextExperiment.js

Step 1: Write minimal context

function transform(data) {
    // TODO: implement
}

Note Copilot's suggestion. Mock it up to see what Copilot suggests. Is it clear? Probably not very helpful.

Step 2: Add JSDoc and a clearer description

/**
 * @param {{ name: string }[]} data
 * @returns {string[]}
 */
function extractNames(data) {
    // Extract the 'name' field from each object and return as a list of strings
}

See how much more relevant Copilot's suggestion becomes. The JSDoc types and better naming transform the quality of suggestions.

Task 2: Variable Name Context

Your goal is to see firsthand how variable and function names steer Copilot's suggestions. You'll write the same logic twice - once with vague names and once with descriptive names - and compare what Copilot offers each time.

Create a file called naming_experiment.py

Step 1: Write this function with vague names and place your cursor after the comment. Wait for Copilot's suggestion - note what it proposes.

def process(d):
    # return the result

Copilot has very little to work with here. What does it suggest? Is it useful or generic?

Step 2: Now write this version directly below it. Again, place your cursor after the comment and observe Copilot's suggestion.

def extract_product_names(products: List[Product]) -> List[str]:
    # iterate over products and return a list of each product's name

What to notice: Did Copilot suggest a list comprehension using product.name? Compare that to what it offered for process(d). The function name, parameter name, and return type together give Copilot everything it needs to suggest the right logic.

Create a file called NamingExperiment.cs

Step 1: Write this method stub and place your cursor inside the body. Wait for Copilot's suggestion - note what it proposes.

public List<string> Process(List<object> d)
{
    // return the result
    
}

Copilot has very little to work with here. What does it suggest? Is it useful or generic?

Step 2: Now write this version directly below it. Place your cursor inside the body and observe Copilot's suggestion.

public List<string> ExtractProductNames(List<Product> products)
{
    // iterate over products and return a list of each product's name
    
}

What to notice: Did Copilot suggest a LINQ Select using p.Name? Compare that to what it offered for Process(d). The method name and typed parameter together give Copilot everything it needs to suggest the right implementation.

Create a file called namingExperiment.js

Step 1: Write this function and place your cursor after the comment. Wait for Copilot's suggestion - note what it proposes.

function process(d) {
    // return the result
}

Copilot has very little to work with here. What does it suggest? Is it useful or generic?

Step 2: Now write this version directly below it. Place your cursor after the comment and observe Copilot's suggestion.

/**
 * @param {{ name: string }[]} items
 * @returns {string[]}
 */
function extractItemNames(items) {
    // iterate over items and return a list of each item's name
}

What to notice: Did Copilot suggest items.map(item => item.name)? Compare that to what it offered for process(d). The descriptive function name, parameter name, and JSDoc types together give Copilot everything it needs to suggest the right logic.

Task 3: Example-Driven Prompting

Create a function that processes lists. First, write an example of what you want:

First, establish a pattern by writing an example:

def format_names(names: List[str]) -> List[str]:
    # Example: ["alice", "bob"] -> ["ALICE", "BOB"]
    return [name.upper() for name in names]

Then, hint that you want to reuse the pattern:

def format_dates(dates: List[str]) -> List[str]:
    # Using the same pattern as format_names above:
    # Example: ["2026-01-17", "2026-02-20"] -> ["Jan 17, 2026", "Feb 20, 2026"]
    # Copilot will follow the established pattern

Copilot will recognize the established pattern and apply similar logic to the new function.

First, establish a pattern by writing an example:

public List<string> FormatNames(List<string> names)
{
    // Example: ["alice", "bob"] -> ["ALICE", "BOB"]
    return names.Select(n => n.ToUpper()).ToList();
}

Then, hint that you want to reuse the pattern:

public List<string> FormatDates(List<string> dates)
{
    // Using the same pattern as FormatNames above:
    // Example: ["2026-01-17", "2026-02-20"] -> ["Jan 17, 2026", "Feb 20, 2026"]
    // Copilot will follow the established pattern
}

Copilot will recognize the established pattern and apply similar logic to the new function.

First, establish a pattern by writing an example:

// Example: ["alice", "bob"] -> ["ALICE", "BOB"]
const formatNames = (names) => names.map((name) => name.toUpperCase());

Then, hint that you want to reuse the pattern:

// Using the same pattern as formatNames above:
// Example: ["2026-01-17", "2026-02-20"] -> ["Jan 17, 2026", "Feb 20, 2026"]
const formatDates = (dates) => {
    // Copilot will suggest a date formatting approach in the same style
};

Copilot will recognize the established pattern and apply similar logic to the new function.

Task 4: Docstring Impact

Compare two implementations:

Step 1 - Without docstring:

def calculate(a, b, c):

Note what Copilot suggests. It might return a + b + c, or (a + b) * c, or just pass - something generic, because the parameter names carry no semantic meaning.

Step 2 - With detailed docstring:

def calculate(a: float, b: float, c: float) -> float:
    """
    Calculate the area of a triangle given three side lengths using Heron's formula.

    Args:
        a: Length of side A
        b: Length of side B
        c: Length of side C

    Returns:
        The area of the triangle
    """

Copilot should now suggest math-based logic along the lines of Heron's formula. Notice the difference in quality.

Important nuance: The docstring is a major steer, not an absolute override. Copilot weighs all available signals together - function name, parameter names and types, surrounding code patterns, and the docstring. Here, the docstring names the algorithm explicitly, which is a very strong signal. But if the function were named sum_sides and the docstring mentioned Heron's formula, those conflicting signals would produce a less predictable result.

Think of the docstring as the loudest voice in the room - it will usually win, but the other signals are still in the conversation. The more your name, types, and docstring all point in the same direction, the more confidently Copilot will follow.

Step 1 - Without documentation:

public double Calculate(double a, double b, double c)

Note what Copilot suggests - likely a generic return a + b + c; or a throw new NotImplementedException(); stub.

Step 2 - With detailed XML documentation:

/// <summary>
/// Calculate the area of a triangle given three side lengths using Heron's formula.
/// </summary>
/// <param name="a">Length of side A</param>
/// <param name="b">Length of side B</param>
/// <param name="c">Length of side C</param>
/// <returns>The area of the triangle</returns>
public double Calculate(double a, double b, double c)

Copilot should now suggest a Math.Sqrt-based implementation. Compare to what it offered before.

Important nuance: The XML doc is a major steer, not an absolute override. Copilot weighs the method name, parameter types, surrounding code, and the documentation together. Naming the algorithm explicitly in <summary> is one of the strongest signals you can give - but it works best when the method name is also consistent. Calculate is neutral, so the docstring dominates. If the method were named SumSides, the conflicting signal would make the result less predictable.

The practical takeaway: align your method name, parameter names, and documentation to point at the same intent. When they agree, Copilot's confidence goes up and its suggestions become much more accurate.

Step 1 - Without documentation:

function calculate(a, b, c) {
    // Copilot has little context; suggestions will be generic
}

Note what Copilot suggests - likely return a + b + c; or similar.

Step 2 - With detailed JSDoc:

/**
 * Calculate the area of a triangle given three side lengths using Heron's formula.
 * @param {number} a Length of side A
 * @param {number} b Length of side B
 * @param {number} c Length of side C
 * @returns {number} The area of the triangle
 */
function calculate(a, b, c) {
}

Copilot should now suggest a Math.sqrt-based Heron's formula implementation. Compare to the generic version.

Important nuance: The JSDoc is a major steer, not an absolute override. Copilot weighs the function name, parameter names, JSDoc types, and surrounding code all together. Here, naming the algorithm in the doc comment is the dominant signal - but it works best when aligned with the other signals. If the function were named sumSides, the contradictory name would dilute the docstring's influence.

The strongest results come when all signals point the same direction: a descriptive function name, typed parameters, and a precise docstring that names the specific algorithm or behaviour you want. Docstrings amplify your intent - they don't replace the need for consistent context across the whole signature.

Reflection Questions

How did adding type hints improve Copilot's suggestions?
Did more detailed comments lead to more accurate code?
When did Copilot fail or surprise you?
How can you apply context engineering to your own projects?

Conclusion

Copilot is a statistical model that excels at pattern matching. By understanding how it uses context (from token windows to naming conventions), you can engineer prompts that guide it toward excellent suggestions. The key is to think like Copilot: provide clear context, use descriptive names, add type information, and write comments that explain your intent in simple, concrete terms.

In the next chapter, we'll apply these mental models to real-world pair programming scenarios.