INDEX
Explanations
content related to scientific measurements or analysis
after specific nouns
New Auto-Interp
Negative Logits
للاسماء
-0.45
rosis
-0.45
&___
-0.44
Signalez
-0.43
상세
-0.43
تكبرها
-0.43
🟤
-0.43
ươi
-0.42
corações
-0.42
<>",
-0.42
POSITIVE LOGITS
beiden
0.98
どちらも
0.98
both
0.96
ambos
0.93
Both
0.92
Both
0.90
both
0.90
entrambi
0.89
beide
0.86
begge
0.85
Activations Density 1.018%