INDEX
Explanations
years, specifically targeting the year 1993
mentions of the year 1993
New Auto-Interp
Negative Logits
Tu
-0.70
semb
-0.69
hed
-0.68
lying
-0.65
tro
-0.62
Stream
-0.60
gery
-0.60
bors
-0.59
Adv
-0.58
lightsaber
-0.58
POSITIVE LOGITS
ĸļ
0.93
å¹
0.85
-'
0.80
theless
0.67
Rwanda
0.65
uncture
0.65
ãĤ¦ãĤ¹
0.65
Leban
0.63
ãĥ£
0.63
Created
0.62
Activations Density 0.021%