INDEX
Explanations
dates and events in history
instances of entities and their roles or descriptions
New Auto-Interp
Negative Logits
Others
-0.60
goodies
-0.55
thereto
-0.53
alike
-0.51
PHOTOS
-0.51
miscar
-0.50
ibles
-0.49
dads
-0.49
Enlarge
-0.49
politics
-0.47
POSITIVE LOGITS
an
1.27
a
1.22
another
0.89
the
0.75
one
0.73
an
0.69
someone
0.67
another
0.63
a
0.60
something
0.58
Activations Density 0.994%