INDEX
Explanations
references to philosophical and metaphysical concepts, alongside excerpts containing complex and intellectual discussions
New Auto-Interp
Negative Logits
Cookie
-0.81
Klu
-0.68
behavi
-0.68
positives
-0.67
stunts
-0.66
ebin
-0.66
ADS
-0.66
Kiw
-0.66
ãĥ¯ãĥ³
-0.66
banana
-0.65
POSITIVE LOGITS
âĢ¢âĢ¢
1.02
§
0.97
Pg
0.91
á¸
0.87
Footnote
0.85
âĢł
0.84
—"
0.81
slave
0.80
[
0.80
âĢij
0.79
Activations Density 5.643%