INDEX
Explanations
references to ownership or possession in relation to entities or concepts
New Auto-Interp
Negative Logits
Squ
-0.16
681
-0.15
agi
-0.15
ãĥĩãĤ£ãĤ¢
-0.14
choice
-0.13
device
-0.13
омеÑĢ
-0.13
hone
-0.13
Choice
-0.13
بشر
-0.13
POSITIVE LOGITS
esser
0.20
contents
0.18
contents
0.18
ilk
0.17
ponents
0.16
ogui
0.16
ammers
0.15
predecessor
0.15
tics
0.15
VIC
0.15
Activations Density 0.271%