INDEX
Explanations
numerical values preceded by "ago"
the frequency of the word "ago" in various contexts
New Auto-Interp
Negative Logits
mathemat
-0.86
suspic
-0.78
plaus
-0.76
glim
-0.72
uniqueness
-0.71
simultaneous
-0.69
illumination
-0.67
tremend
-0.67
intuitive
-0.66
mobility
-0.66
POSITIVE LOGITS
vernment
1.40
zzi
1.05
zzo
0.99
etta
0.95
onga
0.95
ago
0.91
xon
0.91
allo
0.87
asca
0.87
Mata
0.87
Activations Density 0.007%