INDEX
Explanations
expressions of kindness and community involvement
New Auto-Interp
Negative Logits
anga
-0.16
aris
-0.15
abic
-0.14
LEAN
-0.14
semb
-0.14
pec
-0.14
UBY
-0.14
Bates
-0.14
ès
-0.14
rst
-0.14
POSITIVE LOGITS
patron
0.16
pac
0.15
opt
0.15
cheon
0.15
troop
0.15
jug
0.15
badly
0.15
Rendering
0.15
allot
0.15
kiye
0.14
Activations Density 0.056%