How is the underlying embedding matrix updated? #1338

SDUxmh · 2023-06-09T06:46:52Z

I realized the fastText code at the underlying , and adopted the gradient ascending method when calculating gradient, which could be completed by training small sample data. However, when training large sample data, update the embedding matrix and add lr*grad to each word vector. After several epoches, the embedding matrix will explode directly (nan). We want to know how the underlying embedding matrix is updated.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How is the underlying embedding matrix updated? #1338

How is the underlying embedding matrix updated? #1338

SDUxmh commented Jun 9, 2023 •

edited

How is the underlying embedding matrix updated? #1338

How is the underlying embedding matrix updated? #1338

Comments

SDUxmh commented Jun 9, 2023 • edited

SDUxmh commented Jun 9, 2023 •

edited