INDEX
Explanations
descriptions of outcomes or results
New Auto-Interp
Negative Logits
osponsors
-0.67
udi
-0.65
Passage
-0.65
blat
-0.64
don
-0.64
actionGroup
-0.63
NetMessage
-0.62
conservancy
-0.62
afort
-0.62
hare
-0.61
POSITIVE LOGITS
iveness
0.97
thereof
0.94
ivity
0.85
ively
0.81
result
0.75
ivism
0.73
of
0.72
ivist
0.72
aries
0.71
arian
0.69
Activations Density 0.042%