INDEX
Explanations
phrases that indicate unexpected changes or events
New Auto-Interp
Negative Logits
partea
-0.62
présidentielle
-0.59
pyrene
-0.59
Poo
-0.59
marito
-0.59
ritratto
-0.55
estoppel
-0.55
@[
-0.55
setShow
-0.54
frutto
-0.54
POSITIVE LOGITS
suddenly
1.66
sudden
1.63
Sudden
1.52
Sudden
1.52
Suddenly
1.43
Suddenly
1.42
suddenly
1.40
soudain
1.25
突然
1.15
abrupt
1.13
Activations Density 0.113%