INDEX
Negative Logits
ĸļ
-0.89
äºĶ
-0.83
¸
-0.81
comings
-0.78
SourceFile
-0.77
undrum
-0.73
nikov
-0.71
forts
-0.70
eway
-0.70
srfAttach
-0.69
POSITIVE LOGITS
they
0.93
there
0.85
you
0.84
we
0.80
someone
0.80
qualifies
0.77
anyone
0.76
he
0.73
technically
0.73
somebody
0.70
Activations Density 0.029%