INDEX
Explanations
phrases that reference outcomes or conclusions
result is/of
New Auto-Interp
Negative Logits
zwiſchen
-0.73
beſch
-0.68
niſſe
-0.67
ſchen
-0.65
feroit
-0.63
verſch
-0.61
fashiola
-0.61
ſchaft
-0.61
dieſem
-0.60
queſto
-0.60
POSITIVE LOGITS
result
1.01
Ergebnis
0.78
result
0.77
RESULT
0.77
outcome
0.77
Result
0.76
resultado
0.75
resulting
0.74
results
0.73
Resultat
0.68
Activations Density 0.038%