INDEX
Negative Logits
olon
-0.76
brate
-0.69
brates
-0.67
Dome
-0.63
xual
-0.62
esville
-0.61
Kinnikuman
-0.60
è¦ļéĨĴ
-0.59
FIGHT
-0.58
å§«
-0.58
POSITIVE LOGITS
legality
0.77
dubious
0.72
istry
0.71
suspic
0.66
questionable
0.66
CLASSIFIED
0.59
necess
0.58
iferation
0.57
skim
0.57
indisc
0.57
Activations Density 9.430%