INDEX
Explanations
terms related to different viewpoints and perspectives
New Auto-Interp
Negative Logits
nan
-0.16
بد
-0.16
essler
-0.16
ery
-0.15
lum
-0.15
ampion
-0.14
é¾Ħ
-0.14
dy
-0.14
Dawson
-0.14
wick
-0.14
POSITIVE LOGITS
view
0.22
views
0.21
ively
0.19
-view
0.19
(view
0.18
:view
0.17
views
0.17
pective
0.17
eking
0.17
ally
0.16
Activations Density 0.021%