INDEX
Explanations
terms related to direct actions or relationships
instances of the word "direct" indicating various forms of direct interaction or action
New Auto-Interp
Negative Logits
Norn
-0.76
Cursed
-0.72
Peb
-0.71
nesota
-0.69
abies
-0.69
Mell
-0.67
Face
-0.67
Alive
-0.65
Disabled
-0.65
Petersburg
-0.64
POSITIVE LOGITS
direct
1.11
indirect
1.01
sunlight
0.90
irect
0.79
forward
0.76
orial
0.75
htaking
0.75
direct
0.75
cuts
0.74
distribut
0.74
Activations Density 0.010%