INDEX
Explanations
phrases indicating importance or significance
phrases that indicate singular identification or significance
New Auto-Interp
Negative Logits
ooks
-0.66
cores
-0.64
ãĤµ
-0.62
Palestin
-0.60
fences
-0.59
inas
-0.58
older
-0.58
warranties
-0.58
cliffs
-0.58
inity
-0.58
POSITIVE LOGITS
Hundred
0.91
hundred
0.89
dimensional
0.86
rency
0.81
Thousand
0.75
thing
0.74
Piece
0.73
anchester
0.73
eree
0.72
thousand
0.72
Activations Density 0.149%