INDEX
Explanations
specific symbols and characters in a text
occurrences of the word "forbid" and its variations, alongside specific symbols and abbreviations
New Auto-Interp
Negative Logits
uations
-0.75
urally
-0.74
uation
-0.73
orem
-0.72
eur
-0.71
emouth
-0.70
heed
-0.68
owan
-0.66
ciating
-0.65
ements
-0.65
POSITIVE LOGITS
ļéĨĴ
0.93
bie
0.87
atri
0.87
earance
0.87
stract
0.84
é¾įåĸļ士
0.74
bian
0.74
icity
0.72
edia
0.71
Reply
0.71
Activations Density 0.051%