INDEX
Explanations
negative sentiments or experiences related to relationships
stories recounted not happened
tokens that occur at the start of a sentence or turn (sentence/turn-initial words and markers).
New Auto-Interp
Negative Logits
nahilalakip
-0.53
فريبيس
-0.53
spesa
-0.51
/\.
-0.50
comuniques
-0.47
ValueStyle
-0.47
Cyfeiriadau
-0.46
Italijani
-0.46
ifikationer
-0.45
Попис
-0.44
POSITIVE LOGITS
recounts
0.45
Roskov
0.40
stories
0.39
recounted
0.39
awaiter
0.38
tales
0.38
story
0.37
nu
0.37
relatos
0.37
tale
0.36
Activations Density 0.049%