INDEX
Explanations
time durations and temporal markers in the text
New Auto-Interp
Negative Logits
pensiero
-0.36
darbu
-0.35
darb
-0.35
depart
-0.35
restre
-0.34
adering
-0.33
kautta
-0.33
asamblea
-0.32
mondta
-0.32
Dyck
-0.32
POSITIVE LOGITS
Personendaten
0.57
httphttps
0.52
invokeLater
0.52
autorytatywna
0.52
الحياه
0.49
haviors
0.49
increí
0.49
Italijanski
0.48
EndContext
0.48
MigrationBuilder
0.48
Activations Density 0.011%