INDEX
Explanations
phrases related to organization restructuring and categorization
New Auto-Interp
Negative Logits
tar
-0.74
fortun
-0.70
linger
-0.68
insulted
-0.66
nz
-0.65
warning
-0.64
entimes
-0.63
tor
-0.63
press
-0.63
echo
-0.63
POSITIVE LOGITS
manageable
1.04
thirds
0.97
categories
0.92
chunks
0.90
phases
0.89
segments
0.87
discrete
0.86
halves
0.85
Pieces
0.85
pieces
0.84
Activations Density 0.051%