INDEX
Explanations
the word "Of" at a specific position in the text
the presence of the word "Of"
New Auto-Interp
Negative Logits
reperto
-0.74
è¦ļéĨĴ
-0.71
cutter
-0.68
sew
-0.67
2200
-0.64
ocene
-0.63
partName
-0.61
corners
-0.58
istg
-0.58
DAQ
-0.57
POSITIVE LOGITS
course
1.16
ortunately
0.87
elia
0.87
course
0.87
onso
0.85
rontal
0.84
sted
0.84
Lies
0.84
una
0.77
Course
0.76
Activations Density 0.054%