INDEX
Explanations
words that indicate meanings or definitions related to cultural or historical contexts
New Auto-Interp
Negative Logits
icopt
-0.14
achinery
-0.14
zburg
-0.14
egov
-0.13
pha
-0.13
antal
-0.13
orses
-0.13
ocab
-0.12
Drain
-0.12
yles
-0.12
POSITIVE LOGITS
literally
0.29
lit
0.26
liter
0.21
lit
0.21
meaning
0.20
Liter
0.20
translated
0.19
meaning
0.19
translated
0.17
Lit
0.17
Activations Density 0.082%