INDEX
Explanations
phrases that indicate comparisons or similarities between different subjects or findings
New Auto-Interp
Negative Logits
numberWith
-0.65
яко
-0.56
LocalizedString
-0.55
autés
-0.54
hon
-0.54
jScrollPane
-0.53
mpto
-0.53
reszcie
-0.52
Bronnen
-0.52
án
-0.52
POSITIVE LOGITS
similarly
1.08
similar
1.01
same
1.01
same
0.96
同样的
0.95
Same
0.92
Same
0.89
Similarly
0.87
Similar
0.87
similar
0.84
Activations Density 0.426%