INDEX
Explanations
years mentioned in a text
references to specific years
New Auto-Interp
Negative Logits
acci
-0.81
Skinner
-0.68
maple
-0.66
vertisement
-0.65
bip
-0.63
meaning
-0.58
inance
-0.58
diaper
-0.58
illet
-0.57
icone
-0.57
POSITIVE LOGITS
GV
0.86
Guard
0.70
âĢ¢âĢ¢
0.70
AMY
0.69
é¾į
0.68
TED
0.68
ã쮿
0.65
Nicol
0.64
IVERS
0.64
Wars
0.64
Activations Density 0.176%