INDEX
Explanations
words related to decisions and actions
phrases and references to nostalgia and past experiences
New Auto-Interp
Negative Logits
ãĥīãĥ©
-0.60
gang
-0.56
ammy
-0.52
rocal
-0.51
ideshow
-0.50
Moines
-0.50
pires
-0.50
ys
-0.49
iple
-0.48
mx
-0.48
POSITIVE LOGITS
namely
1.25
especially
1.07
hence
0.98
albeit
0.93
particularly
0.93
culminating
0.93
especially
0.93
whereas
0.90
Especially
0.89
although
0.88
Activations Density 0.537%