INDEX
Explanations
instances of the word "one" followed by a number indicating a level of importance or priority
instances of the word "one" in various contexts
New Auto-Interp
Negative Logits
ooks
-0.74
actionGroup
-0.71
ãĥ©ãĥ³
-0.69
ories
-0.66
oof
-0.66
ãĤ¬
-0.65
Available
-0.64
emies
-0.64
inders
-0.63
ãģĤ
-0.62
POSITIVE LOGITS
hundred
1.06
Hundred
0.89
thousand
0.89
thing
0.86
person
0.82
sided
0.80
glance
0.80
wonders
0.79
esan
0.79
embodiment
0.78
Activations Density 0.106%