INDEX
Explanations
strongly emphasized phrases that convey absoluteness or completeness
New Auto-Interp
Negative Logits
Datuak
-0.74
Plated
-0.69
רבה
-0.69
Kontakte
-0.68
nameLabel
-0.68
רבים
-0.67
Lieferumfang
-0.66
Recife
-0.65
Constellation
-0.65
Miquel
-0.65
POSITIVE LOGITS
totally
1.46
completely
1.45
Completely
1.43
Totally
1.40
Totally
1.39
Completely
1.35
totally
1.26
entirely
1.24
completely
1.21
utterly
1.11
Activations Density 0.126%