Implement much faster dot product algorithm for tensors #460
No reviewers
Labels
No Label
bug
dependencies
discussion
documentation
duplicate
feature
good first issue
misc
performance
question
test
use case
wontfix
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: kscience/kmath#460
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "kylchik/fast-dot"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
It is very simple optimization. Using async profiler I found that get method for MutableStructure is bottle neck because of excess work with array index. So I replaced it with direct access to buffer.
Before optimization:
With optimization:
Total increase in performance is around 100x...
Correctness of dot product was checked using space.kscience.kmath.tensors.core.TestDoubleTensorAlgebra#testDot test.