INDEX
Explanations
phrases using the term "much"
expressions indicating a high degree of emphasis or significance
New Auto-Interp
Negative Logits
ogi
-0.72
ortmund
-0.72
alez
-0.71
aring
-0.71
yson
-0.71
AMY
-0.68
iere
-0.66
oft
-0.66
orer
-0.65
OA
-0.64
POSITIVE LOGITS
worthless
0.83
identical
0.82
everywhere
0.81
everything
0.80
nailed
0.80
useless
0.80
everything
0.79
nil
0.75
boils
0.74
intact
0.73
Activations Density 0.065%