INDEX
Explanations
phrases related to academic and legal contexts
New Auto-Interp
Negative Logits
ersed
-0.74
otin
-0.72
bender
-0.69
witch
-0.69
escription
-0.68
trak
-0.68
nesty
-0.68
urations
-0.66
aphael
-0.66
verb
-0.65
POSITIVE LOGITS
own
1.75
OWN
1.02
liking
1.01
repertoire
1.00
respective
0.99
Own
0.99
arsenal
0.98
fingertips
0.94
0.87
portfolio
0.86
Activations Density 0.177%