INDEX
Explanations
German words, specifically related to politics and possibly diplomacy
fragmented or incomplete phrases that are indicative of uncertainty or hesitation
New Auto-Interp
Negative Logits
bending
-0.72
TTC
-0.71
ACP
-0.70
Phoenix
-0.69
independence
-0.68
advertisement
-0.66
streetcar
-0.66
PC
-0.65
Toronto
-0.65
Canadian
-0.65
POSITIVE LOGITS
dies
1.20
die
1.19
Sie
1.12
aber
1.11
von
1.08
beit
1.07
der
1.07
Die
1.04
Das
1.04
Bundes
1.03
Activations Density 0.076%