INDEX
Explanations
phrases containing the word "one" followed by an adjective
the word "one" in various contexts
New Auto-Interp
Negative Logits
ooks
-0.70
ourn
-0.64
gif
-0.63
cats
-0.63
older
-0.62
hips
-0.62
ories
-0.58
ãĤ¢
-0.57
ãĤ¬
-0.57
NZ
-0.56
POSITIVE LOGITS
hundred
0.94
Hundred
0.83
sided
0.79
embodiment
0.79
wonders
0.78
thousand
0.77
thing
0.74
dimensional
0.74
crore
0.66
esan
0.66
Activations Density 0.122%