INDEX
Explanations
phrases related to drive or propulsion
New Auto-Interp
Negative Logits
็จ
-0.63
ktop
-0.61
الحياه
-0.56
typelib
-0.54
ounted
-0.54
ContentAlignment
-0.53
mpo
-0.53
hwar
-0.52
osó
-0.52
jard
-0.51
POSITIVE LOGITS
driven
1.82
Driven
1.74
Driven
1.67
driven
1.56
Powered
1.55
powered
1.54
powered
1.40
Powered
1.39
fuelled
1.20
fueled
1.19
Activations Density 0.140%