INDEX
Explanations
unique characters or symbols like "â̦" that are not typically found in regular text
punctuation marks or special characters
New Auto-Interp
Negative Logits
odium
-0.72
onds
-0.72
reper
-0.72
iculture
-0.70
ppings
-0.69
iciency
-0.68
ppers
-0.67
acters
-0.66
artz
-0.66
grass
-0.66
POSITIVE LOGITS
â̦â̦â̦â̦â̦â̦â̦â̦
1.06
â̦â̦â̦â̦
1.01
BUT
0.87
wait
0.85
Continue
0.80
there
0.75
etc
0.73
was
0.71
had
0.71
////////////////////////////////
0.70
Activations Density 0.020%