INDEX
Explanations
words related to future plans or intentions
terms related to planning and envisioning future scenarios
New Auto-Interp
Negative Logits
unsupported
-0.69
panic
-0.69
capit
-0.67
exit
-0.65
Brit
-0.65
americ
-0.65
whims
-0.64
Panic
-0.63
whining
-0.62
idiots
-0.62
POSITIVE LOGITS
ued
1.15
ues
1.12
elled
1.11
eled
1.11
icked
1.10
aled
1.09
·
1.06
uted
1.05
ved
1.02
ining
1.01
Activations Density 0.116%