Paper Abstract and Keywords |
Presentation |
2014-11-14 10:15
C2CU : A CUDA C Program Generator for Bulk Execution of a Sequential Algorithm Daisuke Takafuji, Koji Nakano, Yasuaki Ito (Hiroshima Univ.) CPSY2014-67 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
A sequential algorithm is oblivious if an address accessed at each time does not depend on input data. Many important tasks including matrix computation, signal processing, sorting, dynamic programming, and encryption/decryption can be performed by oblivious sequential algorithms. Bulk execution of a sequential algorithm is to execute it for many independent inputs in turn or in parallel. The main contribution of this paper is to develop a tool that generates a CUDA C program for the bulk execution of an oblivious sequential algorithm. More specifically, our tool automatically converts a C language program describing an oblivious sequential algorithm into a CUDA C program that performs the bulk execution of the C language program. Generated C programs can be executed in CUDA-enabled GPUs. We have implemented CUDA C programs for the bulk execution of bitonic sorting algorithm, Floyd-Warshall algorithm, and Montgomery modulo multiplication. Our implementations running on GeForce GTX Titan for the bulk execution can be 199 times faster for bitonic sort, 54 times faster for Floyd-Warshall algorithm, and 78 times faster for Montgomery modulo multiplication, over the implementations on a single Intel Xeon CPU. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
GPGPU / CUDA / oblivious algorithms / Floyd-Warshall algorithm / Montgomery modulo multiplication / / / |
Reference Info. |
IEICE Tech. Rep., vol. 114, no. 302, CPSY2014-67, pp. 75-80, Nov. 2014. |
Paper # |
CPSY2014-67 |
Date of Issue |
2014-11-06 (CPSY) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
CPSY2014-67 |
Conference Information |
Committee |
CPSY |
Conference Date |
2014-11-13 - 2014-11-14 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Hiroshima University |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Cloud Computing, etc. |
Paper Information |
Registration To |
CPSY |
Conference Code |
2014-11-CPSY |
Language |
English |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
C2CU : A CUDA C Program Generator for Bulk Execution of a Sequential Algorithm |
Sub Title (in English) |
|
Keyword(1) |
GPGPU |
Keyword(2) |
CUDA |
Keyword(3) |
oblivious algorithms |
Keyword(4) |
Floyd-Warshall algorithm |
Keyword(5) |
Montgomery modulo multiplication |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Daisuke Takafuji |
1st Author's Affiliation |
Hiroshima University (Hiroshima Univ.) |
2nd Author's Name |
Koji Nakano |
2nd Author's Affiliation |
Hiroshima University (Hiroshima Univ.) |
3rd Author's Name |
Yasuaki Ito |
3rd Author's Affiliation |
Hiroshima University (Hiroshima Univ.) |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2014-11-14 10:15:00 |
Presentation Time |
25 minutes |
Registration for |
CPSY |
Paper # |
CPSY2014-67 |
Volume (vol) |
vol.114 |
Number (no) |
no.302 |
Page |
pp.75-80 |
#Pages |
6 |
Date of Issue |
2014-11-06 (CPSY) |
|