INDEX
Explanations
mentions of the word "Nap" in various contexts
references to "Napoleon" or related terms
New Auto-Interp
Negative Logits
Gemini
-0.91
DonaldTrump
-0.77
UCT
-0.75
Carbuncle
-0.70
×Ļ×
-0.70
````
-0.68
âĸ¬âĸ¬
-0.67
âĢ¢âĢ¢âĢ¢âĢ¢
-0.66
³³³
-0.66
erroneous
-0.65
POSITIVE LOGITS
oleon
1.39
olit
1.32
alm
1.31
olitan
1.28
erville
1.05
oli
0.98
rose
0.95
ixels
0.91
ole
0.89
ster
0.89
Activations Density 0.012%