INDEX
Explanations
specific instances or actions mentioned in a document
New Auto-Interp
Negative Logits
natureconservancy
-0.84
inventoryQuantity
-0.82
Alive
-0.74
iddler
-0.68
ylon
-0.67
alive
-0.65
urity
-0.64
ingham
-0.63
Plain
-0.62
Trance
-0.61
POSITIVE LOGITS
toward
1.31
towards
1.21
downwards
0.94
irection
0.93
Towards
0.93
rils
0.92
ggle
0.87
squarely
0.86
downward
0.84
ges
0.83
Activations Density 1.092%