INDEX
Explanations
sentences expressing the idea of potential outcomes or conditions
New Auto-Interp
Negative Logits
ĺ
-0.69
iat
-0.69
wine
-0.68
ires
-0.67
Shift
-0.66
don
-0.65
bands
-0.62
Lines
-0.62
Lives
-0.61
Month
-0.61
POSITIVE LOGITS
fulfilling
1.04
achieving
0.98
guaranteeing
0.97
eliminating
0.93
resolving
0.93
justifying
0.90
agreeing
0.90
acknowledging
0.89
completing
0.87
fruition
0.87
Activations Density 0.047%