INDEX
Explanations
phrases indicating requirements or necessities
New Auto-Interp
Negative Logits
etros
-0.14
mdb
-0.14
ylko
-0.13
uarios
-0.13
romium
-0.13
ınca
-0.13
ancock
-0.13
rypton
-0.13
ekyll
-0.13
rá»Ļng
-0.13
POSITIVE LOGITS
exactly
0.23
precisely
0.21
právÄĽ
0.20
именно
0.20
perfectly
0.18
literal
0.17
Exactly
0.17
Ñģаме
0.17
totiž
0.16
literally
0.16
Activations Density 0.079%