INDEX
Explanations
pronouns referring to collective groups or individuals
New Auto-Interp
Negative Logits
Tanz
-0.75
Neurolog
-0.65
amiya
-0.63
rose
-0.63
ilial
-0.61
Liang
-0.60
Hayden
-0.59
asionally
-0.58
umerable
-0.58
Garner
-0.58
POSITIVE LOGITS
traction
0.93
bearings
0.83
acquainted
0.83
hooked
0.79
attention
0.78
foothold
0.78
started
0.74
chy
0.74
ready
0.74
juices
0.73
Activations Density 0.115%