INDEX
Explanations
references to specific individuals or events
New Auto-Interp
Negative Logits
itſelf
-0.83
humaine
-0.76
médicale
-0.67
umana
-0.65
feroit
-0.64
ſelf
-0.64
pixabay
-0.63
BoxShadow
-0.63
militaires
-0.62
spirituale
-0.62
POSITIVE LOGITS
kasarigan
0.80
bronn
0.61
clean
0.59
@
0.57
<?
0.57
krim
0.56
setzer
0.54
nice
0.53
dialed
0.52
thiệu
0.52
Activations Density 0.345%