INDEX
Explanations
words related to movement or physical actions
New Auto-Interp
Negative Logits
++++++++++++++++
-0.62
cavalli
-0.58
memberof
-0.55
仇
-0.54
Britton
-0.52
Mackay
-0.52
bentuk
-0.52
Cohn
-0.52
mayores
-0.51
斌
-0.51
POSITIVE LOGITS
Peek
0.97
snippet
0.90
―――――
0.88
Tikang
0.88
flirt
0.86
Theſe
0.85
snippets
0.84
hinting
0.84
$_"
0.83
Hues
0.83
Activations Density 0.427%