INDEX
Explanations
years in the 1990s, with a particularly high activation for the year 1990
references to the year 1990 and its context within historical narratives
New Auto-Interp
Negative Logits
hire
-0.78
bow
-0.74
holder
-0.70
lightsaber
-0.69
clipboard
-0.69
edge
-0.69
Edge
-0.68
llers
-0.67
cloth
-0.67
semble
-0.67
POSITIVE LOGITS
ĸļ
1.02
ãĤ¦ãĤ¹
0.74
GOODMAN
0.70
ãĥŁ
0.69
cture
0.68
ãĥ¤
0.65
emonium
0.65
aji
0.64
zsche
0.64
å¹
0.63
Activations Density 0.017%