INDEX
Explanations
references to physical movements or conditions involving the legs
instances of the word "lim" or its variations in various contexts
New Auto-Interp
Negative Logits
ALK
-0.77
EEK
-0.76
Kent
-0.70
YE
-0.68
Brand
-0.64
Watching
-0.64
Citizens
-0.62
ãĥ¯ãĥ³
-0.62
ATA
-0.62
Deadly
-0.61
POSITIVE LOGITS
lim
1.26
elight
1.01
ptin
0.94
bered
0.92
inished
0.90
ovable
0.89
erick
0.87
corrid
0.87
pless
0.86
ewater
0.84
Activations Density 0.003%