INDEX
Explanations
proper nouns
references to the name "Ross."
New Auto-Interp
Negative Logits
rious
-0.80
Countdown
-0.72
ACTED
-0.71
rous
-0.64
à¨
-0.62
conspicuous
-0.60
brance
-0.59
lder
-0.59
minded
-0.58
RON
-0.58
POSITIVE LOGITS
etti
1.16
iter
1.16
endale
1.06
olini
1.03
enger
1.00
imo
0.99
ign
0.99
bach
0.99
dale
0.95
andra
0.94
Activations Density 0.024%