INDEX
Explanations
the frequency or repetition of the word "many" in different contexts
New Auto-Interp
Negative Logits
UAL
-0.76
anut
-0.73
icism
-0.71
OPE
-0.69
agame
-0.68
istan
-0.67
ngth
-0.67
oulos
-0.65
atum
-0.65
ACTED
-0.64
POSITIVE LOGITS
facets
1.09
times
1.04
aspects
1.02
thousands
0.94
kinds
0.94
different
0.93
body
0.86
thousand
0.86
unanswered
0.85
occasions
0.85
Activations Density 0.067%