INDEX
Explanations
specific names or terms related to various contexts, such as individuals or organizations
New Auto-Interp
Negative Logits
Maher
-0.79
Letter
-0.68
Daughter
-0.67
TIME
-0.66
Principle
-0.65
Winn
-0.65
Messenger
-0.63
messenger
-0.63
Notice
-0.63
tug
-0.63
POSITIVE LOGITS
eca
1.02
ic
1.01
ac
1.01
ico
1.00
isc
1.00
esc
0.99
ib
0.98
ec
0.98
ibus
0.94
icro
0.94
Activations Density 0.288%