INDEX
Explanations
terms related to religious or doctrinal teachings
New Auto-Interp
Negative Logits
Musical
-0.50
envies
-0.49
Grüsse
-0.48
Busy
-0.47
Photo
-0.47
Miss
-0.46
Mang
-0.46
Merry
-0.45
Kevin
-0.45
Andy
-0.45
POSITIVE LOGITS
doctrine
0.89
Doctrine
0.88
doctrines
0.78
doctrine
0.66
Doctrine
0.64
doctrina
0.63
tenet
0.58
AnchorStyles
0.56
doctr
0.54
getRule
0.54
Activations Density 0.010%