INDEX
Explanations
concepts related to reality versus perception
New Auto-Interp
Negative Logits
idopsis
-0.77
Alban
-0.72
rąg
-0.67
chiato
-0.63
PON
-0.62
electrónico
-0.58
gebracht
-0.57
mallows
-0.57
Morde
-0.56
bench
-0.56
POSITIVE LOGITS
Reality
1.38
reality
1.35
Reality
1.33
realities
1.29
reality
1.24
]';
1.12
الواقع
1.08
realidade
0.98
)";
0.96
realty
0.95
Activations Density 0.079%