INDEX
Explanations
phrases starting with "To"
phrases indicating intention or purpose
New Auto-Interp
Negative Logits
rets
-0.74
pretty
-0.67
kind
-0.65
SpaceEngineers
-0.63
uments
-0.63
ãģł
-0.62
minster
-0.61
miss
-0.60
rage
-0.59
hest
-0.59
POSITIVE LOGITS
researchers
0.75
psychologists
0.69
physicists
0.69
however
0.69
filmmakers
0.67
we
0.67
astronomers
0.67
moreover
0.66
Forbes
0.66
planners
0.64
Activations Density 0.215%