INDEX
Explanations
terms related to fossil fuels
New Auto-Interp
Negative Logits
esp
-0.07
aday
-0.06
066
-0.06
Goose
-0.06
ongsTo
-0.06
ittings
-0.06
istrovstvÃŃ
-0.06
lessly
-0.06
ãĥ©ãĥ³ãĤ¹
-0.06
erval
-0.06
POSITIVE LOGITS
Kaynak
0.07
/ros
0.07
ilere
0.07
Як
0.07
fuels
0.07
è³
0.07
yıldır
0.07
nhiên
0.07
ifer
0.06
Zu
0.06
Activations Density 0.001%