INDEX
Explanations
instances where the word "wouldn't" appears, indicating hypothetical contradictions or denials
New Auto-Interp
Negative Logits
存于互联网档案馆
-0.58
arşivlendi
-0.57
یری
-0.51
Hiller
-0.51
edig
-0.51
damska
-0.50
Sleep
-0.50
Bucure
-0.49
sleep
-0.49
eclared
-0.49
POSITIVE LOGITS
would
0.97
WOULD
0.90
wouldnt
0.87
would
0.80
ⓧ
0.78
nakalista
0.76
wouldn
0.75
zou
0.75
Would
0.75
disambiguazione
0.74
Activations Density 0.293%