Lush vs Bash Benchmark Report

Model: claude-sonnet-4-20250514 · Latest run: 20260401T183152Z · Tasks: 18

Summary

TaskCat BashTurns LushTurns
fizzbuzzalgorithm PASS1 PASS1
reverse_stringalgorithm PASS1 PASS1
two_sumalgorithm PASS1 PASS1
env_configenvironment FAIL4 PASS2
env_path_builderenvironment PASS0 PASS1
path_normalizerenvironment PASS0 PASS1
file_organizerfilesystem FAIL4 PASS1
multi_file_searchfilesystem PASS1 PASS2
todo_managerfilesystem PASS0 PASS1
csv_transformpipeline PASS0 PASS1
currency_converterpipeline PASS0 PASS1
locale_weather_urlpipeline PASS0 PASS1
log_parserpipeline PASS0 PASS1
network_info_parserpipeline PASS0 PASS1
pipeline_transformpipeline PASS1 PASS1
pipeline_word_freqpipeline PASS0 PASS1
url_normalizerpipeline PASS0 PASS1
process_exit_codesprocess PASS4 PASS1
Total 16/18 18/18

Per-Category Summary

Category Bash PassLush Pass Bash Avg TurnsLush Avg Turns Bash Avg ScoreLush Avg Score
algorithm 3/33/3 1.01.0 3.53.9
environment 2/33/3 4.01.3 2.83.9
filesystem 2/33/3 2.51.3 3.13.8
pipeline 8/88/8 1.01.0 3.03.9
process 1/11/1 4.01.0 3.24.0

Questionnaire Scores

Questionnaire comparison

Questionnaire Scores by Category

Per-category questionnaire

Agent Turns (Solve Mode)

Turns comparison

Score Difference Heatmap (Lush - Bash)

Score heatmap

Per-Category Breakdown

algorithm

algorithm breakdown

environment

environment breakdown

filesystem

filesystem breakdown

pipeline

pipeline breakdown

process

process breakdown

Per-Task Detail

fizzbuzz [algorithm/solve] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity35+2
Signal-to-noise34+1
Familiar conventions45+1
Built-in operations45+1
String operations440
Composition53-2
I/O ergonomics54-1
Data structures440
Error model23+1
Edge case support23+1
Learnability35+2
Fitness for task440

reverse_string [algorithm/solve] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity440
Signal-to-noise54-1
Familiar conventions25+3
Built-in operations550
String operations45+1
Composition54-1
I/O ergonomics54-1
Data structures440
Error model330
Edge case support330
Learnability45+1
Fitness for task440

two_sum [algorithm/solve] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity24+2
Signal-to-noise23+1
Familiar conventions34+1
Built-in operations440
String operations34+1
Composition43-1
I/O ergonomics440
Data structures34+1
Error model23+1
Edge case support220
Learnability34+1
Fitness for task24+2

env_config [environment/solve] bash=FAIL lush=PASS

MetricBashLushDiff
Syntax clarity24+2
Signal-to-noise23+1
Familiar conventions24+2
Built-in operations34+1
String operations34+1
Composition45+1
I/O ergonomics45+1
Data structures34+1
Error model23+1
Edge case support23+1
Learnability24+2
Fitness for task35+2

env_path_builder [environment/convert] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity24+2
Signal-to-noise24+2
Familiar conventions34+1
Built-in operations330
String operations330
Composition43-1
I/O ergonomics45+1
Data structures440
Error model23+1
Edge case support330
Learnability24+2
Fitness for task440

path_normalizer [environment/convert] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity24+2
Signal-to-noise24+2
Familiar conventions25+3
Built-in operations24+2
String operations35+2
Composition43-1
I/O ergonomics45+1
Data structures34+1
Error model23+1
Edge case support330
Learnability24+2
Fitness for task34+1

file_organizer [filesystem/solve] bash=FAIL lush=PASS

MetricBashLushDiff
Syntax clarity24+2
Signal-to-noise34+1
Familiar conventions25+3
Built-in operations45+1
String operations34+1
Composition43-1
I/O ergonomics45+1
Data structures35+2
Error model24+2
Edge case support23+1
Learnability24+2
Fitness for task45+1

multi_file_search [filesystem/solve] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity440
Signal-to-noise43-1
Familiar conventions34+1
Built-in operations52-3
String operations440
Composition53-2
I/O ergonomics54-1
Data structures34+1
Error model23+1
Edge case support23+1
Learnability24+2
Fitness for task53-2

todo_manager [filesystem/convert] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity24+2
Signal-to-noise24+2
Familiar conventions34+1
Built-in operations23+1
String operations24+2
Composition43-1
I/O ergonomics440
Data structures24+2
Error model220
Edge case support23+1
Learnability34+1
Fitness for task34+1

csv_transform [pipeline/convert] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity24+2
Signal-to-noise34+1
Familiar conventions24+2
Built-in operations440
String operations440
Composition53-2
I/O ergonomics45+1
Data structures34+1
Error model23+1
Edge case support23+1
Learnability24+2
Fitness for task440

currency_converter [pipeline/convert] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity24+2
Signal-to-noise23+1
Familiar conventions24+2
Built-in operations12+1
String operations34+1
Composition440
I/O ergonomics440
Data structures24+2
Error model23+1
Edge case support330
Learnability24+2
Fitness for task24+2

locale_weather_url [pipeline/convert] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity24+2
Signal-to-noise34+1
Familiar conventions24+2
Built-in operations25+3
String operations35+2
Composition440
I/O ergonomics45+1
Data structures34+1
Error model23+1
Edge case support34+1
Learnability24+2
Fitness for task45+1

log_parser [pipeline/convert] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity24+2
Signal-to-noise34+1
Familiar conventions24+2
Built-in operations440
String operations45+1
Composition53-2
I/O ergonomics45+1
Data structures34+1
Error model23+1
Edge case support330
Learnability24+2
Fitness for task440

network_info_parser [pipeline/convert] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity24+2
Signal-to-noise24+2
Familiar conventions35+2
Built-in operations25+3
String operations25+3
Composition440
I/O ergonomics45+1
Data structures34+1
Error model23+1
Edge case support23+1
Learnability24+2
Fitness for task45+1

pipeline_transform [pipeline/solve] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity34+1
Signal-to-noise53-2
Familiar conventions24+2
Built-in operations53-2
String operations440
Composition52-3
I/O ergonomics54-1
Data structures440
Error model23+1
Edge case support23+1
Learnability34+1
Fitness for task53-2

pipeline_word_freq [pipeline/convert] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity24+2
Signal-to-noise440
Familiar conventions25+3
Built-in operations54-1
String operations45+1
Composition53-2
I/O ergonomics54-1
Data structures34+1
Error model23+1
Edge case support330
Learnability24+2
Fitness for task54-1

url_normalizer [pipeline/convert] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity24+2
Signal-to-noise24+2
Familiar conventions34+1
Built-in operations24+2
String operations35+2
Composition440
I/O ergonomics45+1
Data structures34+1
Error model23+1
Edge case support34+1
Learnability24+2
Fitness for task34+1

process_exit_codes [process/solve] bash=PASS lush=PASS

MetricBashLushDiff
Syntax clarity24+2
Signal-to-noise34+1
Familiar conventions24+2
Built-in operations45+1
String operations440
Composition53-2
I/O ergonomics45+1
Data structures34+1
Error model330
Edge case support23+1
Learnability24+2
Fitness for task550