INDEX
Explanations
words and phrases expressing degrees or measures of quantity and intensity
New Auto-Interp
Negative Logits
viation
-0.47
ấu
-0.46
bihan
-0.46
sponsored
-0.45
⎪
-0.45
tation
-0.45
trie
-0.45
tiek
-0.44
addItem
-0.44
andescent
-0.44
POSITIVE LOGITS
to
1.54
to
1.21
TO
1.19
To
1.11
To
1.09
לה
1.05
להת
0.93
να
0.90
לס
0.89
TO
0.89
Activations Density 0.174%