INDEX
Explanations
philosophical inquiries about the nature of understanding and existence
New Auto-Interp
Negative Logits
acidade
-0.48
̈́
-0.46
uitz
-0.46
ScopeManager
-0.45
odw
-0.43
HFILL
-0.43
請繼續往下閱讀
-0.42
laaj
-0.42
druž
-0.42
Paglinawan
-0.41
POSITIVE LOGITS
things
1.03
something
0.98
thing
0.94
Things
0.92
anything
0.92
anything
0.91
something
0.90
everything
0.86
things
0.85
everything
0.84
Activations Density 0.954%