INDEX
Explanations
words related to significant changes or reorganizations
terms related to transformation or restructuring
New Auto-Interp
Negative Logits
erity
-0.73
spoilers
-0.65
bribes
-0.64
gom
-0.63
abst
-0.62
osity
-0.60
hereby
-0.59
foul
-0.59
iaries
-0.59
ously
-0.58
POSITIVE LOGITS
figured
0.96
fig
0.88
itialized
0.86
FORMATION
0.84
aping
0.82
cend
0.81
facing
0.81
formation
0.79
ital
0.78
ension
0.78
Activations Density 0.056%