INDEX
Explanations
numerical or measurement-related terms
phrases that approximate quantities or measurements
New Auto-Interp
Negative Logits
oli
-0.68
ves
-0.64
ieu
-0.63
rive
-0.62
alez
-0.62
Express
-0.62
iments
-0.62
oft
-0.61
ousel
-0.61
oa
-0.60
POSITIVE LOGITS
analogous
0.96
equivalent
0.83
WATCHED
0.81
Ĥİ
0.80
contempor
0.79
Ń·
0.76
820
0.76
speaking
0.75
midway
0.75
800
0.74
Activations Density 0.033%