INDEX
Explanations
instances of the word 'based' in various contexts
New Auto-Interp
Negative Logits
idon
-0.79
rompt
-0.69
oint
-0.62
complication
-0.61
confidentiality
-0.61
payday
-0.61
Upload
-0.61
duplication
-0.61
Loan
-0.59
referral
-0.59
POSITIVE LOGITS
atorium
0.77
loosely
0.76
カ
0.72
eful
0.72
pha
0.70
ズ
0.66
encies
0.65
squarely
0.63
elled
0.63
SPONSORED
0.63
Activations Density 0.029%