INDEX
Explanations
credit-related terms, especially related to credit cards
references to credit cards and related financial terms
New Auto-Interp
Negative Logits
Bei
-0.79
gha
-0.73
hm
-0.71
tering
-0.70
zel
-0.70
cham
-0.66
theless
-0.65
Kus
-0.65
++++++++
-0.65
Siem
-0.63
POSITIVE LOGITS
card
1.25
card
1.17
worthiness
1.13
CARD
1.13
cards
1.09
cards
1.09
Cards
0.97
Card
0.97
Card
0.88
worthy
0.87
Activations Density 0.025%