INDEX
Explanations
phrases and concepts related to educational content and guidance
New Auto-Interp
Negative Logits
isser
-0.17
Paid
-0.15
iei
-0.15
mute
-0.15
Applied
-0.15
aran
-0.14
esi
-0.14
_paid
-0.14
Called
-0.14
rine
-0.14
POSITIVE LOGITS
contained
0.68
contained
0.59
included
0.55
included
0.48
Contained
0.45
featured
0.42
Included
0.42
INCLUDED
0.40
Included
0.39
-contained
0.38
Activations Density 0.307%