INDEX
Explanations
phrases related to information sharing and transfer
New Auto-Interp
Negative Logits
xdb
-0.17
inherits
-0.17
ÑĤин
-0.16
gom
-0.15
uç
-0.15
-END
-0.15
ahren
-0.15
ãģıãĤĭ
-0.14
ãģ«ãģªãĤĭ
-0.14
tradi
-0.13
POSITIVE LOGITS
already
0.85
already
0.74
Already
0.72
Already
0.68
å·²ç»ı
0.63
å·²
0.59
å·²
0.53
_already
0.52
Ñĥже
0.52
sudah
0.51
Activations Density 0.590%