Prompt Engineering : Parameters, Techniques, and Examples

As I continue documenting my daily learning in Generative AI, today’s focus was on prompt engineering—and it became clear very quickly that prompting is not guesswork or experimentation.

From a QA perspective, prompts are test inputs.
Poorly designed prompts behave like flaky tests.
Well‑designed prompts produce consistent, verifiable, and explainable outputs.

This post explains:

how prompts are structured
how inference parameters control behavior
how negative prompts and constraints reduce risk
how zero‑shot, few‑shot, and chain‑of‑thought prompting apply to real QA tasks

What Is a Prompt?

A prompt is the input instruction that defines how a generative model should behave.

A good prompt answers:

What should the model do?
What context should it use?
What must it avoid?
How should the output look?

Prompts are not questions — they are instructions.

Core Elements of an Effective Prompt

A well‑designed prompt typically includes:

Role or perspective
Task definition
Context
Constraints and exclusions
Output format

QA‑Style Example

“You are a QA engineer. Analyze the test failure details below and summarize the issue focusing on impact and reproducibility. Do not assume missing information. Output the response as bullet points.”

This mirrors how QA writes clear acceptance criteria.

Negative Prompts: Guardrails for AI

Negative prompts explicitly define what the model must NOT do.
They are critical for reducing hallucinations and prevent the model from producing hate speech, explicit content, or biased language

Example

“Generate a summary of this defect. Do NOT invent root causes. Do NOT introduce features not mentioned. If information is insufficient, state that clearly.”

From a QA mindset, this enforces honest reporting.

Prompt engineering is iterative:

tighten wording
add constraints
remove ambiguity
enforce format

QA engineers already do this when refining:

test cases
bug templates
automation assertions

Prompt iteration follows the same discipline.

Inference Parameters: Controlling AI Behavior

Inference parameters control how the model responds — not what it knows.

Randomness and Diversity Controls

🔹 Temperature

Low → deterministic and repeatable
High → creative and variable

QA workflows prefer low temperature for consistency.

🔹 Top‑P (Nucleus Sampling)

Controls how much probability mass is used for token selection.

Lower → focused outputs
Higher → more variety

🔹 Top‑K

Limits token selection to the top K choices.

Smaller K → predictability
Larger K → richer language

Combine low temperature + reasonable Top‑P/K to reduce flakiness.

Length Controls

🔹 Maximum Tokens

Prevents over‑generation.

🔹 Stop Sequences

Stops output at defined markers.

Essential for structured QA outputs.

Best Practices for Prompting (QA Lens)

Be explicit, not conversational
Define constraints clearly
Always specify output format
Use negative prompts for safety
Control randomness for repeatability
Validate outputs against expectations

Prompt Engineering Techniques with QA Scenarios

1. Zero‑Shot Prompting

The model is given instructions only, no examples.

Example

“Classify this defect as Functional, UI, Performance, or Security.”

Works well for:

simple triage
fast classification

Less reliable for ambiguous cases.

2. Few‑Shot Prompting

The model is shown examples before the task.

Example

“Example: ‘Page load exceeds 10 seconds’ → Performance ‘Login button overlaps text’ → UI

Now classify: ‘Search results misaligned on mobile’”

Far more consistent and production‑ready.

QA parallel: data‑driven testing.

3. Chain‑of‑Thought Prompting

The model is asked to reason step by step.

Example

“Analyze the test failure step by step. Explain possible causes and identify what data is missing.”

Ideal for:

defect analysis
flaky test investigation
root‑cause discussions

QA benefit: transparent reasoning, not just output.

Prompt Regression Testing (Often Overlooked)

Prompt changes are code changes.

Best practices:

version prompts
store baseline outputs
compare outputs after edits
validate no regression in accuracy or coverage

Treat prompts like test scripts.

Prompt Engineering in Practice: Real QA Scenarios That Actually Work

Below are practical prompt‑engineering scenarios that QA engineers actually face.

Scenario 1: Generating Test Scenarios from Requirements

Weak Prompt

Generate test cases for login functionality.

Issues:

Generic
No structure
No coverage expectations
Results vary on each run

QA‑Optimized Prompt:

You are a QA engineer. Generate test scenarios for login functionality.

Context:

User logs in with email and password
Invalid credentials show an error
Account locks after 5 failed attempts

Constraints:

Include happy path, negative, and edge cases
Do not invent features not described

Output format: Return results as a table with: Test Scenario | Preconditions | Steps | Expected Result

Why this works:

Clear role
Explicit context
Defined constraints
Verifiable output structure

QA takeaway: This is equivalent to writing clear acceptance criteria before automation.

Scenario 2: Using Negative Prompts to Prevent Hallucinations

Without Negative Prompts

Analyze this bug report and explain the root cause.

Risk:

AI may invent modules, services, or dependencies

With Negative Prompts

Analyze the provided bug report and explain the root cause.

Do NOT:

Invent root causes not mentioned
Introduce new features
Assume missing logs or data

If information is insufficient, clearly state that the root cause cannot be determined.

Why this works:

Forces honesty
Prevents false confidence
Matches QA reporting standards

QA mindset: “If data is missing, say it’s missing.”

Scenario 3: Controlling Randomness for Consistent QA Output Problem

Different outputs for the same prompt make validation difficult.

This is where inference parameters matter.

QA‑Friendly Inference Settings

Temperature: 0.2
Top‑P: 0.9
Top‑K: 40
Max Tokens: Controlled (e.g., 300)
Stop Sequence: ---END---

Result:

Stable responses
Reduced variance
Repeatable validation

QA principle: Determinism > creativity for testing workflows.

Scenario 4: Zero‑Shot Prompting for Fast Classification

Use Case: Classifying Defects

Prompt

Classify the following defect into one category: Functional, Performance, UI, or Security.

Defect: "Application crashes when searching with special characters."

When to use:

Simple classification
Early triage
Low‑risk decisions

Limitation:

Less accurate for ambiguous cases

Scenario 5: Few‑Shot Prompting for Higher Accuracy

Prompt

Classify defects into Functional, Performance, UI, or Security.

Examples:

"Page load takes more than 10 seconds" → Performance

"Button overlap on mobile" → UI

"Unauthorized access to admin page" → Security

Now classify:

"Search returns incorrect results for valid filters"

Why QA prefers this:

Reduces ambiguity
Produces consistent outputs
Suitable for production tools

This mirrors data‑driven testing in automation.

Scenario 6: Chain‑of‑Thought Prompting for Debugging Failures

Use Case: Analyzing a flaky test failure

Prompt

Analyze the following test failure step by step.

Explain:

What failed
Possible causes
Which causes are most likely
What additional data is needed to confirm

Do not jump to conclusions.

Best for:

Incident analysis Intermittent failures Root cause discussions

QA advantage: Transparent reasoning, not just an answer.

Scenario 7: Prompt Regression Testing (Often Missed)

QA often forgets this: Prompt changes are code changes.

Recommended practice:

Version prompts
Store baseline outputs
Compare outputs after prompt updates
Validate no regression in coverage or accuracy

Treat prompts like:

Test cases
Config files
Business rules

QA’s Role in Prompt Engineering

QA engineers bring structure by:

validating prompt clarity
testing prompt edge cases
controlling output variability
detecting hallucinations
enforcing reproducibility

In short:

Prompt engineering is quality engineering for AI.

Final Thoughts

Today’s learning reinforced an important truth for me as an SDET:

AI systems don’t fail silently — they fail due to ambiguous, untested inputs.

Prompt engineering applies the same discipline QA engineers already practice: clarity, structure, constraints, and validation.

Good prompts don’t happen by accident — they’re engineered.

— Hema

Command Palette

What Is a Prompt?

Core Elements of an Effective Prompt

QA‑Style Example

Negative Prompts: Guardrails for AI

Example

Prompt Iteration and Refinement

Inference Parameters: Controlling AI Behavior

Randomness and Diversity Controls

🔹 Temperature

🔹 Top‑P (Nucleus Sampling)

🔹 Top‑K

Length Controls

🔹 Maximum Tokens

🔹 Stop Sequences

Best Practices for Prompting (QA Lens)

Prompt Engineering Techniques with QA Scenarios

1. Zero‑Shot Prompting

Example

2. Few‑Shot Prompting

Example

3. Chain‑of‑Thought Prompting

Example

Prompt Regression Testing (Often Overlooked)

Prompt Engineering in Practice: Real QA Scenarios That Actually Work

Scenario 1: Generating Test Scenarios from Requirements

Scenario 2: Using Negative Prompts to Prevent Hallucinations

Scenario 3: Controlling Randomness for Consistent QA Output Problem

Scenario 4: Zero‑Shot Prompting for Fast Classification

Scenario 5: Few‑Shot Prompting for Higher Accuracy

Scenario 6: Chain‑of‑Thought Prompting for Debugging Failures

Scenario 7: Prompt Regression Testing (Often Missed)

QA’s Role in Prompt Engineering

Final Thoughts

Comments

AI for QA

When Prompts Go Wrong: Hidden Risks in AI Every QA Engineer Must Know

More from this blog