INDEX
Explanations
elements related to animals or pets
New Auto-Interp
Negative Logits
eſt
-0.71
}],
-0.68
]-->
-0.68
Statics
-0.66
MenuView
-0.66
ſever
-0.65
#
-0.65
Ведь
-0.64
Ведь
-0.64
referenties
-0.64
POSITIVE LOGITS
fucking
0.99
FUCKING
0.88
fuck
0.84
stupid
0.84
goddamn
0.83
stupidly
0.81
fuck
0.81
fucking
0.80
shitty
0.80
dumbass
0.80
Activations Density 0.855%