INDEX
Explanations
terms related to discussions and debates
New Auto-Interp
Negative Logits
</i>
-0.79
но
-0.73
er
-0.71
Abp
-0.70
o
-0.69
Dina
-0.68
zu
-0.67
ca
-0.65
co
-0.64
o
-0.64
POSITIVE LOGITS
debate
1.50
debate
1.48
Debate
1.36
Debate
1.34
debates
1.33
debated
1.33
itſelf
1.33
DEB
1.29
houſe
1.25
myſelf
1.25
Activations Density 0.079%