INDEX
Explanations
references to elevators and related infrastructure
New Auto-Interp
Negative Logits
تب
-0.17
оÑĢод
-0.16
Fu
-0.15
cầm
-0.14
adius
-0.14
ynam
-0.14
undo
-0.14
Ĥ
-0.14
odia
-0.14
emi
-0.13
POSITIVE LOGITS
elevator
0.42
Elev
0.35
elev
0.34
lifts
0.29
Lift
0.29
lift
0.29
lift
0.26
levator
0.25
.lift
0.20
thang
0.19
Activations Density 0.053%