INDEX
Explanations
explaining or presenting information
New Auto-Interp
Negative Logits
undergo
0.78
absorb
0.74
populate
0.72
absorbs
0.71
absor
0.70
吸収
0.69
ABSOR
0.68
Consume
0.66
Receive
0.65
WOW
0.65
POSITIVE LOGITS
menyatakan
1.17
強調
1.15
подчер
1.13
enfat
1.11
argument
1.07
menyebut
1.06
menjelaskan
1.04
强调
1.03
sottoline
1.03
指出
1.02
Activations Density 0.094%