INDEX
Explanations
instances of urgency or importance in discussions
New Auto-Interp
Negative Logits
peater
-0.17
Lug
-0.16
urg
-0.15
æ²Ļ
-0.14
fly
-0.14
arel
-0.14
usty
-0.14
pled
-0.14
Nicholson
-0.14
_PW
-0.14
POSITIVE LOGITS
pekt
0.16
cname
0.15
retty
0.15
ronic
0.15
903
0.15
pek
0.15
974
0.15
sphere
0.14
bild
0.14
034
0.14
Activations Density 0.007%