INDEX
Explanations
Biblical references and phrases
references to religious or scriptural citations
New Auto-Interp
Negative Logits
ortunately
-0.68
stamp
-0.68
tack
-0.66
behavi
-0.66
popcorn
-0.66
clos
-0.65
consolid
-0.65
tradem
-0.64
pudding
-0.62
plush
-0.62
POSITIVE LOGITS
1
1.37
6
1.29
5
1.28
8
1.27
7
1.27
4
1.26
3
1.26
9
1.26
31
1.24
11
1.23
Activations Density 0.022%