INDEX
Negative Logits
ology
-0.64
Ze
-0.61
EMOS
-0.59
miento
-0.59
liek
-0.58
Jaffe
-0.58
Morin
-0.57
Ad
-0.56
Pand
-0.56
LECTIONS
-0.56
POSITIVE LOGITS
</
1.88
)</
1.73
"</
1.53
}</
1.53
."</
1.49
,</
1.44
.</
1.41
;</
1.40
?</
1.39
!</
1.36
Activations Density 0.079%