INDEX
Explanations
phrases related to providing information or instructions
sections of text that introduce lists or enumerations
New Auto-Interp
Negative Logits
ãĥı
-0.74
ointed
-0.61
natureconservancy
-0.58
angering
-0.58
roud
-0.56
hya
-0.55
ãĤ¹ãĥĪ
-0.55
cill
-0.54
Ern
-0.54
ãĥ¼ãĥĨ
-0.54
POSITIVE LOGITS
are
0.96
Thumbnails
0.92
neath
0.87
is
0.86
ground
0.80
summarizes
0.79
allery
0.77
depicts
0.74
screenshot
0.74
above
0.71
Activations Density 0.042%