INDEX
Explanations
terms related to curiosity
terms related to curiosity and a desire for knowledge or exploration
New Auto-Interp
Negative Logits
thren
-0.80
nesium
-0.76
anas
-0.74
ergy
-0.73
yna
-0.71
lim
-0.70
mits
-0.68
oran
-0.68
ãĥİ
-0.66
literally
-0.66
POSITIVE LOGITS
curiosity
1.56
iously
0.86
puzz
0.79
iosity
0.76
mong
0.75
onlook
0.72
sear
0.72
curious
0.71
vier
0.71
Cub
0.70
Activations Density 0.011%