INDEX
Explanations
the word "roll" or phrases related to rolling actions or mechanisms
New Auto-Interp
Negative Logits
ãĥ¤
-1.10
ãĤ»
-0.96
bub
-0.93
ãĥŁ
-0.91
orial
-0.89
ILE
-0.88
ãĤ±
-0.87
perse
-0.86
ilities
-0.86
acters
-0.85
POSITIVE LOGITS
sheet
1.08
downhill
1.04
roll
1.04
Roll
1.01
aways
1.00
back
0.99
outs
0.98
anut
0.96
roll
0.94
rolling
0.94
Activations Density 6.145%