INDEX
Explanations
connections and contrasts between entities or ideas
New Auto-Interp
Negative Logits
autorytatywna
-0.88
ImageContext
-0.86
таратура
-0.82
Rüyada
-0.81
समीक्षाओं
-0.80
彿
-0.79
للمعارف
-0.79
nonUne
-0.78
Portail
-0.77
كومونز
-0.76
POSITIVE LOGITS
ſelf
0.62
himſelf
0.58
myſelf
0.56
itſelf
0.52
ſel
0.51
paſſ
0.50
CONDU
0.50
perſon
0.49
laſt
0.49
]=-
0.48
Activations Density 0.194%