INDEX
Negative Logits
produces
-0.08
globals
-0.07
면
-0.07
_do
-0.07
략
-0.07
lanan
-0.07
ксп
-0.07
//.
-0.06
머
-0.06
такого
-0.06
POSITIVE LOGITS
rally
0.17
rallies
0.14
Rally
0.14
rallied
0.12
rallying
0.11
Rory
0.08
ally
0.08
Marty
0.08
rolley
0.07
Tony
0.07
Activations Density 0.002%