INDEX
Explanations
verbs related to affirming or confirming statements
phrases related to reaffirming commitments or positions
New Auto-Interp
Negative Logits
Lup
-0.75
AH
-0.75
OTA
-0.74
Created
-0.71
Galile
-0.69
Gray
-0.68
Golem
-0.67
Tul
-0.67
Hungry
-0.67
Leary
-0.66
POSITIVE LOGITS
irmed
1.43
irming
1.28
reaff
1.25
irmation
1.25
irms
1.13
ront
1.05
licted
0.96
uates
0.91
bourg
0.90
irm
0.88
Activations Density 0.006%