INDEX
Explanations
specific examples
instances where "for example" is used to introduce illustrative cases or explanations
New Auto-Interp
Negative Logits
ressed
-0.68
sil
-0.66
emate
-0.66
ormal
-0.65
parliamentary
-0.65
ements
-0.65
ELY
-0.64
rive
-0.64
vell
-0.62
ogun
-0.62
POSITIVE LOGITS
Takeru
0.68
Jenkins
0.67
Schn
0.65
=#
0.64
owing
0.64
iHUD
0.63
lihood
0.63
ãĥīãĥ©
0.63
Æ
0.63
©¶æ¥µ
0.63
Activations Density 0.020%