INDEX
Explanations
words related to slight adjustments or changes
incremental changes or modifications
New Auto-Interp
Negative Logits
nor
-0.78
dest
-0.73
all
-0.72
ahu
-0.71
anca
-0.68
ios
-0.67
mining
-0.67
ondon
-0.66
Nations
-0.65
tics
-0.65
POSITIVE LOGITS
slightly
3.16
marginally
2.27
somewhat
2.05
mildly
1.79
considerably
1.78
lightly
1.77
moderately
1.74
slight
1.74
Slightly
1.74
faintly
1.65
Activations Density 0.017%