INDEX
Explanations
adverbs that describe actions
the word "so" in various contexts
New Auto-Interp
Negative Logits
glances
-0.65
wreck
-0.63
realities
-0.60
gallery
-0.60
inch
-0.58
cradle
-0.58
disbelief
-0.58
silhouette
-0.58
ropolitan
-0.58
acre
-0.57
POSITIVE LOGITS
bered
1.19
oths
1.12
othes
1.07
apy
1.05
aps
0.99
othe
0.99
oner
0.97
oooo
0.93
ooo
0.91
zin
0.90
Activations Density 0.095%