INDEX
Explanations
conclusory phrases or summarizing statements in the text
New Auto-Interp
Negative Logits
rokken
-0.58
pourtant
-0.56
itchfield
-0.54
model
-0.53
Sociales
-0.50
actionBar
-0.49
%)$
-0.49
truction
-0.48
untura
-0.47
porus
-0.47
POSITIVE LOGITS
Lastly
0.98
lastly
0.95
Lastly
0.95
Finally
0.88
Finally
0.87
ostavi
0.86
Obrázky
0.85
Personendaten
0.81
finally
0.81
finally
0.78
Activations Density 0.162%