INDEX
Explanations
quotations and statements made by individuals
quotes or speech from characters
New Auto-Interp
Negative Logits
rats
-0.75
ģ«
-0.67
effected
-0.67
respective
-0.61
tainted
-0.60
impact
-0.60
levant
-0.60
代
-0.57
hijacked
-0.57
corruption
-0.56
POSITIVE LOGITS
chuck
0.90
recalls
0.84
laughs
0.80
Laughs
0.79
remembers
0.78
pauses
0.75
recalling
0.74
laughs
0.73
shrug
0.69
toggle
0.69
Activations Density 0.653%