INDEX
Explanations
words related to prevalence or frequency
instances of the word "common" and its variations in different contexts
New Auto-Interp
Negative Logits
mosp
-0.88
zik
-0.80
usalem
-0.75
etry
-0.70
=-=-
-0.70
forestation
-0.70
uana
-0.69
ophon
-0.67
oÄŁ
-0.67
ignt
-0.66
POSITIVE LOGITS
occurrence
1.02
alities
1.01
occurrences
1.01
places
0.98
ality
0.95
wealth
0.94
denomin
0.90
place
0.77
pmwiki
0.74
Sym
0.72
Activations Density 0.029%