INDEX
Explanations
words related to making changes or modifications
terms related to modifications or changes
New Auto-Interp
Negative Logits
enic
-0.80
Clancy
-0.71
gil
-0.70
Vide
-0.67
VP
-0.67
ILE
-0.66
Chrys
-0.66
ardy
-0.65
jay
-0.65
ocry
-0.64
POSITIVE LOGITS
adjustment
1.50
adjustments
1.40
adjust
1.10
adjust
1.10
adjusting
1.03
Adjust
0.98
thresholds
0.95
aution
0.95
adjusts
0.93
Adjust
0.89
Activations Density 0.005%