INDEX
Explanations
phrases related to geographical or environmental descriptions
New Auto-Interp
Negative Logits
anker
-0.15
Cliff
-0.14
quantitative
-0.14
Closed
-0.14
chez
-0.13
-oper
-0.13
atten
-0.13
chod
-0.13
atel
-0.13
ehen
-0.13
POSITIVE LOGITS
lying
0.29
lie
0.25
lies
0.24
Stretch
0.23
lies
0.23
astr
0.23
border
0.21
Stretch
0.20
stretch
0.20
stretching
0.20
Activations Density 0.175%