INDEX
Explanations
comparative statements indicating difficulty or challenge
instances where the word "harder" is used to describe increasing difficulty or challenges
New Auto-Interp
Negative Logits
Kings
-0.71
reen
-0.69
oin
-0.67
Loft
-0.66
Lights
-0.66
EVA
-0.66
rophe
-0.65
Rus
-0.65
Constantine
-0.65
Monaco
-0.65
POSITIVE LOGITS
than
1.08
Than
0.85
harder
0.81
resil
0.78
nces
0.76
behaved
0.75
toget
0.75
forgiving
0.74
risky
0.72
scrambled
0.69
Activations Density 0.023%