INDEX
Explanations
mentions of parts, shares, or quantities of something
references to fractions or segments of items or concepts
New Auto-Interp
Negative Logits
Fighters
-0.71
Trend
-0.68
generic
-0.68
verbs
-0.67
ATHER
-0.65
raid
-0.63
lear
-0.61
æ©
-0.59
Iran
-0.59
friends
-0.59
POSITIVE LOGITS
thereof
1.05
meal
0.79
portions
0.77
edly
0.76
ials
0.76
ILCS
0.73
itol
0.73
xual
0.72
icularly
0.71
xit
0.70
Activations Density 0.013%