INDEX
Explanations
assertive statements of fact or truth
New Auto-Interp
Negative Logits
prefixer
-0.44
stdc
-0.41
ributors
-0.41
JpaRepository
-0.41
tuch
-0.40
defaultstate
-0.40
araan
-0.40
mous
-0.39
astan
-0.39
MTG
-0.39
POSITIVE LOGITS
False
0.59
True
0.56
True
0.56
believers
0.56
FALSE
0.54
False
0.54
false
0.51
believer
0.50
FALSE
0.47
true
0.47
Activations Density 0.105%