INDEX
Negative Logits
amily
-0.38
nces
-0.37
eq
-0.36
phas
-0.35
atism
-0.35
Sons
-0.35
ajor
-0.35
anchester
-0.34
faith
-0.33
missing
-0.32
POSITIVE LOGITS
or
0.55
etc
0.50
.''.
0.44
)).
0.43
®,
0.38
.�
0.36
entary
0.36
tray
0.35
unnoticed
0.35
.ãĢį
0.35
Activations Density 15.702%