Novel Parallel Algorithms for Fast Multi-GPU-Based Generation of Massive Scale-Free Networks

Alam, Mohammad Monzurul; Perumalla, Kalyan S.; Sanders, Peter

doi:10.1007/s41019-019-0088-6

Cited by 16 publications

(10 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In recent years, more paralleled deep learning methods have been brought up [11]. In the aspect of algorithms, several algorithms have been brought up to accelerate multi-GPU implementation or make the inference more accurate [1,26] and faster [7,12]. Moreover, there are researches have been done to integrate DP and MP [8].…”

Section: Multi-gpu Parallel Computingmentioning

confidence: 99%

Multi-task learning based on question–answering style reviews for aspect category classification and aspect term extraction on GPU clusters

Cheng

Wang

et al. 2020

Cluster Comput

View full text Add to dashboard Cite

Cluster computing technologies are rapidly advancing and user-generated online reviews are booming in the current Internet and e-commerce environment. The latest question–answering (Q&A)-style reviews are novel, abundant and easily digestible product reviews that also contain massive valuable information for customers. In this paper, we mine valuable aspect information of products contained in these reviews on GPU clusters. To achieve this goal, we utilize two subtasks of aspect-based sentiment analysis: aspect term extraction (ATE) and aspect category classification (ACC). Most previous works focused on only one task or solved these two tasks separately, even though they are highly interrelated, and they do not make full use of abundant training resources. To address this problem, we propose a novel multi-task neural learning model to jointly handle these two tasks and explore the performance of our model on GPU clusters. We conducted extensive comparative experiments on an annotated corpus and found that our proposed model outperforms several baseline models in ATE and ACC tasks on GPU clusters, yielding significant strides in data mining for these types of reviews.

show abstract

Section: Multi-gpu Parallel Computingmentioning

confidence: 99%

Multi-task learning based on question–answering style reviews for aspect category classification and aspect term extraction on GPU clusters

Cheng

Wang

et al. 2020

Cluster Comput

View full text Add to dashboard Cite

show abstract

“…Alam et al [12] transfer the algorithm into the distributed memory model and show how dependency chains, which are short in practice, are resolved e ciently in parallel. Alternatively if multi-edges are acceptable, [294] can be adapted to multi-GPU scenarios [11] yielding an even more scalable approach.…”

Section: Algorithmic Similarities Between Ba and Node Copymentioning

confidence: 99%

“…In that spirit, we execute EM ES on an undirected version of the crawled ClueWeb12 graph's core [324] which we obtain by deleting all nodes corresponding to uncrawled URLs. 11 Performing k = m swaps on this graph with n ≈ 9.8 • 10 8 nodes and m ≈ 3.7 • 10 10 edges is feasible in less than 19.1 h on SysB. Bhuiyan et al propose a distributed edge switching algorithm and evaluate it on a compute cluster with 64 nodes each equipped with two Intel Xeon E5-2670 2.60GHz 8-core processors and 64GB RAM [43].…”

Section: Em Esmentioning

confidence: 99%

“…In this section we evaluate the quality of the proposed algorithms and analyze the runtime of our C++ implementations. 11 EM CB, IM CB, EM GCB are designed as modules of NetworKit [316]; due to their superior performance, only the latter two were added to the library and are available since release 4.6. EM PGCB's implementation is developed separately and facilitates external memory data structures and algorithms of STXXL [102].…”

Section: Experimental Evaluationmentioning

confidence: 99%

“…To achieve comparability, we removed connectivity tests, xed memory management issues, and adopted the number of swaps. Further, we extended counters for edge ids and accumulated degrees to 64 bit integers in order to support experiments with more than 2 30 edges 11. We consider such vertices atypically simple as they have degree 1 and account for ≈84 % of nodes.…”

mentioning

confidence: 99%

See 2 more Smart Citations

Scalable generation of random graphs

Penschuck¹

View full text Add to dashboard Cite

Netzwerkmodelle spielen in verschiedenen Wissenschaftsdisziplinen eine wichtige Rolle und dienen unter anderem der Beschreibung realistischer Graphen. Sie werden häufig als Zufallsgraphen formuliert und stellen somit Wahrscheinlichkeitsverteilungen über Graphen dar. Meist ist die Verteilung dabei parametrisiert und ergibt sich implizit, etwa über eine randomisierten Konstruktionsvorschrift. Ein früher Vertreter ist das G(n,p) Modell, welches über allen ungerichteten Graphen mit n Knoten definiert ist und jede Kante unabhängig mit Wahrscheinlichkeit p erzeugt. Ein aus G(n,p) gezogener Graph hat jedoch kaum strukturelle Ähnlichkeiten zu Graphen, die zumeist in Anwendungen beobachtet werden. Daher sind populäre Modelle so gestaltet, dass sie mit hinreichend hoher Wahrscheinlichkeit gewünschte topologische Eigenschaften erzeugen. Beispielsweise ist es ein gängiges Ziel die nur unscharf definierte Klasse der sogenannten komplexen Netzwerke nachzubilden, der etwa viele soziale Netze zugeordnet werden. Unter anderem verfügen diese Graphen in der Regel über eine Gradverteilung mit schweren Rändern (heavy-tailed), einen kleinen Durchmesser, eine dominierende Zusammenhangskomponente, sowie über überdurchschnittlich dichte Teilbereiche, sogenannte Communities. Die Einsatzmöglichkeiten von Netzwerkmodellen gehen dabei weit über das ursprüngliche Ziel, beobachtete Effekte zu erklären, hinaus. Ein gängiger Anwendungsfall besteht darin, Daten systematisch zu produzieren. Solche Daten ermöglichen oder unterstützen experimentelle Untersuchungen, etwa zur empirischen Verifikation theoretischer Vorhersagen oder zur allgemeinen Bewertung von Algorithmen und Datenstrukturen. Hierbei ergeben sich insbesondere für große Probleminstanzen Vorteile gegenüber beobachteten Netzen. So sind massive Eingaben, die auf echten Daten beruhen, oft nicht in ausreichender Menge verfügbar, nur aufwendig zu beschaffen und zu verwalten, unterliegen rechtlichen Beschränkungen, oder sind von unklarer Qualität. In der vorliegenden Arbeit betrachten wir daher algorithmische Aspekte der Generierung massiver Zufallsgraphen. Um Anwendern Reproduzierbarkeit mit vorhandenen Studien zu ermöglichen, fokussieren wir uns hierbei zumeist auf getreue Implementierungen etablierter Netzwerkmodelle, etwa Preferential Attachment-Prozesse, LFR, simple Graphen mit vorgeschriebenen Gradsequenzen, oder Graphen mit hyperbolischer (o.Ä.) Einbettung. Zu diesem Zweck entwickeln wir praktisch sowie analytisch effiziente Generatoren. Unsere Algorithmen sind dabei jeweils auf ein geeignetes Maschinenmodell hin optimiert. Hierzu entwerfen wir etwa klassische sequentielle Generatoren für Registermaschinen, Algorithmen für das External Memory Model, und parallele Ansätze für verteilte oder Shared Memory-Maschinen auf CPUs, GPUs, und anderen Rechenbeschleunigern.

show abstract

GHSH: Dynamic Hyperspace Hashing on GPU

Ren

et al. 2020

Web and Big Data

View full text Add to dashboard Cite

Novel Parallel Algorithms for Fast Multi-GPU-Based Generation of Massive Scale-Free Networks

Cited by 16 publications

References 21 publications

Multi-task learning based on question–answering style reviews for aspect category classification and aspect term extraction on GPU clusters

Multi-task learning based on question–answering style reviews for aspect category classification and aspect term extraction on GPU clusters

Scalable generation of random graphs

GHSH: Dynamic Hyperspace Hashing on GPU

Contact Info

Product

Resources

About