INDEX
Explanations
references to tradition or traditional practices
New Auto-Interp
Negative Logits
aq
-0.15
manship
-0.14
es
-0.14
erra
-0.14
idel
-0.14
Descriptors
-0.14
ãģ¹ãģį
-0.14
roughly
-0.14
opc
-0.14
/he
-0.14
POSITIVE LOGITS
ists
0.21
ized
0.20
itionally
0.19
izing
0.17
ization
0.17
/original
0.17
izes
0.17
ize
0.16
ised
0.16
zie
0.16
Activations Density 0.033%