INDEX
Explanations
phrases indicating unexpected outcomes or surprises in narratives
New Auto-Interp
Negative Logits
ⓧ
-0.60
OFDb
-0.52
WithIOException
-0.51
condicion
-0.50
slee
-0.48
adh
-0.48
Wikimedijinoj
-0.47
contextLoads
-0.47
bones
-0.47
Personensuche
-0.47
POSITIVE LOGITS
Apparently
0.84
Apparently
0.81
apparently
0.76
Ternyata
0.67
apparently
0.67
Seems
0.59
ternyata
0.59
Evidently
0.57
blijkt
0.56
原來
0.54
Activations Density 0.251%