INDEX
Explanations
dates and current events
New Auto-Interp
Negative Logits
aughed
-0.79
emaker
-0.68
imov
-0.68
henko
-0.68
ashtra
-0.64
rosse
-0.62
essee
-0.61
urat
-0.60
Trouble
-0.60
76561
-0.60
POSITIVE LOGITS
adays
0.98
abouts
0.92
afternoon
0.87
morning
0.85
days
0.76
lights
0.74
evening
0.71
marks
0.70
tics
0.69
here
0.69
Activations Density 0.390%