INDEX
Explanations
words related to legal and political matters
a specific symbol or character in the text, suggesting it looks for the presence of special characters or symbols
New Auto-Interp
Negative Logits
clitor
-0.82
targeted
-0.74
spir
-0.72
pigeon
-0.71
indo
-0.71
bounty
-0.70
couch
-0.70
reflex
-0.69
appropri
-0.69
barg
-0.69
POSITIVE LOGITS
ï¸ı
1.40
ï¸
1.02
ternity
0.96
Reason
0.88
\/\/
0.87
uthor
0.86
Previously
0.86
Discuss
0.84
Wr
0.84
Roberts
0.84
Activations Density 0.156%