INDEX
Negative Logits
timing
0.95
延迟
0.88
pausing
0.86
Timing
0.83
delayed
0.83
flashbacks
0.77
doodles
0.76
pause
0.76
ಕಾಲ
0.76
refrained
0.75
POSITIVE LOGITS
лта
0.79
কা
0.79
길
0.78
atrocious
0.77
excessive
0.77
unwarrant
0.76
unnecessarily
0.76
excessively
0.76
сто
0.75
最
0.75
Activations Density 0.029%