INDEX
Explanations
individual nouns or noun phrases that contain the word "of"
the preposition "of"
New Auto-Interp
Negative Logits
partName
-0.76
CBI
-0.62
ss
-0.61
cf
-0.61
prices
-0.61
barg
-0.60
passers
-0.60
galleries
-0.59
scores
-0.59
appra
-0.59
POSITIVE LOGITS
rontal
1.33
sky
1.00
uture
0.96
eatures
0.95
ortunately
0.93
unction
0.92
icial
0.92
rost
0.88
rame
0.87
course
0.87
Activations Density 0.027%