INDEX
Explanations
concepts related to morality and virtue
New Auto-Interp
Negative Logits
صÙĨ
-0.15
_LVL
-0.15
resh
-0.15
หาร
-0.14
Marketable
-0.14
VEC
-0.14
å§ij
-0.14
sono
-0.14
enburg
-0.14
imson
-0.14
POSITIVE LOGITS
pets
0.17
eto
0.17
ิà¹ī
0.15
stdcall
0.15
fully
0.15
ajas
0.15
contr
0.14
textTheme
0.13
deltas
0.13
acer
0.13
Activations Density 0.009%