INDEX
Explanations
phrases related to falling or being off track/down
instances of personal transformation or emotional experiences
New Auto-Interp
Negative Logits
prominently
-0.81
roundup
-0.79
Offic
-0.71
cheaply
-0.69
SourceFile
-0.67
reportedly
-0.64
expensive
-0.64
proudly
-0.63
inspected
-0.63
Thousands
-0.61
POSITIVE LOGITS
conclusions
1.01
illusion
0.97
enment
0.92
EStreamFrame
0.91
thinking
0.89
izons
0.83
athetic
0.78
toes
0.78
understanding
0.77
direction
0.77
Activations Density 0.406%