INDEX
Explanations
phrases related to historical information, particularly about events or facts
references to historical events or records
New Auto-Interp
Negative Logits
umbers
-0.75
emouth
-0.73
atron
-0.71
uters
-0.70
ople
-0.68
uple
-0.68
aband
-0.67
tackle
-0.65
hement
-0.65
cliffe
-0.64
POSITIVE LOGITS
Origins
0.87
Background
0.84
========
0.84
origins
0.82
Early
0.81
Past
0.80
Orig
0.78
âĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢ
0.77
Spoiler
0.77
Appearance
0.71
Activations Density 0.086%