Sign in
Author

Conference

Journal

Organization

Year

DOI
Look for results that meet for the following criteria:
since
equal to
before
between
and
Search in all fields of study
Limit my searches in the following fields of study
Agriculture Science
Arts & Humanities
Biology
Chemistry
Computer Science
Economics & Business
Engineering
Environmental Sciences
Geosciences
Material Science
Mathematics
Medicine
Physics
Social Science
Multidisciplinary
Keywords
(6)
Instruction Scheduling
Matrix Function
Memory Access
Optimization Technique
Scientific Computing
Basic Linear Algebra Subprograms
Subscribe
Academic
Publications
Optimization of Triangular Matrix Functions in BLAS Library on Loongson2F
Optimization of Triangular Matrix Functions in BLAS Library on Loongson2F,10.1007/9783642156724_5,Yun Xu,Mingzhi Shao,Da Teng
Edit
Optimization of Triangular Matrix Functions in BLAS Library on Loongson2F
BibTex

RIS

RefWorks
Download
Yun Xu
,
Mingzhi Shao
,
Da Teng
BLAS (Basic
Linear Algebra
Subprograms) plays a very important role in
scientific computing
and engineering applications. ATLAS is often recommended as a way to generate an optimized BLAS library. Based on ATLAS, this paper optimizes the algorithms of triangular matrix functions on 750 MHZ Loongson 2F processorspecific architecture. Using loop unrolling,
instruction scheduling
and data prefetching techniques, computing time and
memory access
delay are both reduced, and thus the performance of functions is improved. Experimental results indicate that these optimization techniques can effectively reduce the running time of functions. After optimization, doubleprecision type function of TRSM has the speed of 1300Mflops, while singleprecision type function has the speed of 1800Mflops. Compared with ATLAS, the performance of function TRSM is improved by 50% to 60%, even by 100% to 200% under smallscale input.
Conference:
Network and Parallel Computing  NPC
, pp. 3545, 2010
DOI:
10.1007/9783642156724_5
Cumulative
Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
(
www.springerlink.com
)
(
www.springerlink.com
)
(
dx.doi.org
)
(
www.informatik.unitrier.de
)
More »
References
(8)
Basic Linear Algebra Subprograms for Fortran Usage
(
Citations: 682
)
C. L. Lawson
,
Richard J. Hanson
,
D. R. Kincaid
,
Fred T. Krogh
Journal:
ACM Transactions on Mathematical Software  TOMS
, vol. 5, no. 3, pp. 308323, 1979
Finite field linear algebra subroutines
(
Citations: 39
)
JeanGuillaume Dumas
,
Thierry Gautier
,
Clément Pernet
Conference:
International Symposium on Symbolic and Algebraic Computation  ISSAC
, pp. 6374, 2002
Recursive Blocked Algorithms and Hybrid Data Structures for Dense Matrix Library Software
(
Citations: 83
)
Erik Elmroth
,
Fred Gustavson
,
Isak Jonsson
,
Bo Kagstrom
Journal:
Siam Review  SIAM REV
, vol. 46, no. 1, pp. 345, 2004
The cacheoblivious gaussian elimination paradigm: theoretical framework, parallelization and experimental evaluation
(
Citations: 14
)
Rezaul Alam Chowdhury
,
Vijaya Ramachandran
Conference:
ACM Symposium on Parallel Algorithms and Architectures  SPAA
, pp. 7180, 2007
Automated Empirical Optimization of Software and the ATLAS Project
(
Citations: 431
)
Unknown
Published in 2000.