INDEX
Explanations
phrases indicating reported speech or citations
New Auto-Interp
Negative Logits
gab
-0.55
pers
-0.54
Wer
-0.50
Wer
-0.48
off
-0.46
mittel
-0.45
isc
-0.45
Department
-0.45
bau
-0.45
ork
-0.44
POSITIVE LOGITS
saying
1.18
saying
1.17
Saying
1.14
Saying
1.11
say
1.10
SAY
1.08
say
1.08
SAY
1.06
says
1.04
Says
1.00
Activations Density 0.171%