INDEX
Explanations
references to specific characters or actions occurring in a narrative context
Unusual situations or observations
saw defendant or pile of
New Auto-Interp
Negative Logits
apunov
-0.52
szak
-0.50
impuesto
-0.47
icoli
-0.46
vált
-0.46
irchen
-0.45
impuestos
-0.44
gynhyrchwyd
-0.43
ổi
-0.43
心中的
-0.43
POSITIVE LOGITS
seemed
0.74
something
0.66
Someone
0.66
someone
0.66
Looked
0.65
+#+
0.64
Someone
0.64
похоже
0.64
strangely
0.63
suddenly
0.62
Activations Density 0.207%