INDEX
Explanations
names of specific brands, companies, and individuals
New Auto-Interp
Negative Logits
âĶĢâĶĢâĶĢâĶĢ
-0.72
etheless
-0.68
separatist
-0.67
divided
-0.65
depreciation
-0.65
silence
-0.64
disputed
-0.63
pseudo
-0.63
heightened
-0.62
charge
-0.62
POSITIVE LOGITS
ona
1.03
onia
1.01
acia
0.99
ava
0.98
avia
0.98
oya
0.97
inda
0.96
onda
0.94
ora
0.94
inia
0.94
Activations Density 0.426%