boomerAMG on GPU for SolidMechanicsLagrangianSSLE #1054

castelletto1 · 2020-07-16T01:26:17Z

The purpose of this PR is to enable boomerAMG-preconditioning on GPU for an elasticity problem. The linear algebra interface is set to hypre by default. As a model problem the following simple cantilivered cube problem has been added:

SSLE-QS-cantileveredCube.xml

The mesh consists of a unit cube that is discretized with a regular 10x10x10 Cartesian mesh.

GEOS-DEV/LvArray#213

The Hypre build options are in the corresponding thirdPartyLibs PR:
GEOS-DEV/thirdPartyLibs#132

oseikuffuor1 · 2020-08-20T21:32:50Z

All, here are some recommendations for building and running with hypre on the GPU.
Configure Options:

--with-cuda
--enable-cusparse (this is optional and allows one to use cusparse for certain operations. You can ignore this for now)
--enable-debug (Helpful for debugging issues down the road. Can be ommitted if the integration is stable)
Need to set the environment variable HYPRE_CUDA_SM to match the SM (streaming multiprocessor) for the hardware. Default is 60, but change to 70 if running on lassen (V100 systems) for example.
See https://hypre.readthedocs.io/en/latest/ch-misc.html for more details about building hypre for GPU support.

Enabling GPU support

In user code, first call HYPRE_Init(); to initialize some GPU libraries prior to calling any other hypre functions
2a. Set the HYPRE_ExecutionPolicy variable to device: HYPRE_ExecutionPolicy default_exec_policy = HYPRE_EXEC_DEVICE;
2b. Set the default policy handle: hypre_HandleDefaultExecPolicy(hypre_handle()) = default_exec_policy; (this tells hypre to do AMG setup on device. Clearly you can combine step 2 and just set the handle to HYPRE_EXEC_DEVICE)
Call HYPRE_Finalize(); at the end (before MPI_Finalize is called)
See ij.c or ij_assembly.c in test directory of hypre for additional insight.

Runtime solver options:

Relaxation options: Use options 18 or 7.
Coarsening options: Only PMIS is supported
Interpolation options: Use options 3, 6, 14 or 15.
Once everything is up and running, there may be other options to tune for performance. I have omitted them for now.

There's one minor edit from our discussion yesterday. I believe there were two scenarios:

Assemble linear system on host and pass to hypre
Assemble linear system on device and pass to hypre

Since geosx does not rely on unified memory, I am leaning towards option 2, as long as the data given to hypre is a consistent parcsr matrix data. I had mentioned that passing hypre host data should work, but this is only true with unified memory. Without unified memory, option 1 could be realized by moving the assembled matrix to device and calling hypre then. If you have questions about the linear system matrix setup on device, let me know and we can chat again. Of course let me know also if you have additional questions about these notes.

andrea-franceschini · 2020-09-18T22:25:23Z

Does hypre provide a block Jacobi preconditioner?

oseikuffuor1 · 2020-09-21T05:27:39Z

Does hypre provide a block Jacobi preconditioner?

@AF1990 are these blocks per processor or per unknowns? We do not have a BJ preconditioned for the unknown version, but for the per processor blocks, we have the BJ ILU preconditioner.

andrea-franceschini · 2020-09-21T14:11:11Z

@AF1990 are these blocks per processor or per unknowns? We do not have a BJ preconditioned for the unknown version, but for the per processor blocks, we have the BJ ILU preconditioner.

I am interested in the unknown version of the BJ preconditioner. I know that the processor version is already available.

rrsettgast

@oseikuffuor1 I think this is setup per your instructions, but execution on Lassen using nvprof doesn't show a cuda kernel so it must still be executing on host. Any suggestions?

src/coreComponents/linearAlgebra/interfaces/hypre/HypreInterface.cpp

src/coreComponents/linearAlgebra/interfaces/hypre/HypreSolver.cpp

src/coreComponents/linearAlgebra/interfaces/hypre/HypreInterface.cpp

…parse and cudarand libs to the link line for hypre

…EOSX into feature/boomerAMG-for-elasticity

src/coreComponents/linearAlgebra/interfaces/MatrixBase.hpp

src/coreComponents/linearAlgebra/interfaces/direct/SuiteSparse.hpp

rrsettgast · 2021-03-08T02:25:02Z

src/coreComponents/linearAlgebra/interfaces/direct/SuperLU_Dist.cpp

@@ -26,11 +26,13 @@ namespace geosx

 // Check matching requirements on index/value types between GEOSX and SuperLU_Dist

+#if !defined(GEOSX_USE_HYPRE_CUDA)


@corbett5 @klevzoff I won't be addressing this since hypre will be fixing their global dog index types soon.

src/coreComponents/linearAlgebra/interfaces/hypre/HypreSolver.cpp

src/coreComponents/managers/initialization.cpp

src/coreComponents/physicsSolvers/LinearSolverParameters.hpp

…EOSX into feature/boomerAMG-for-elasticity

src/cmake/thirdparty/SetupGeosxThirdParty.cmake

src/coreComponents/linearAlgebra/interfaces/hypre/HypreMatrix.hpp

klevzoff · 2021-03-08T09:16:45Z

src/coreComponents/linearAlgebra/utilities/LAIHelperFunctions.hpp

+  CRSMatrix< real64 > tempMat;
+  tempMat.resize( localRows, src.numGlobalCols(), maxDstEntries );
+
+  for( globalIndex r=0; r<localRows; ++r )


Did you mean for this to be a parallel kernel launch? Or what's the purpose of using 2D arrays for srcIndices and srcValues?

There are a bunch of host functions in this for loop. They are from hypre, so we don't have control over them. It is pretty strange, but the underlying hypre functions are all host, but if running on device, they take device pointers.

klevzoff · 2021-03-08T09:21:35Z

src/docs/doxygen/GeosxConfig.hpp

 /// Enables use of PETSc library (CMake option ENABLE_PETSC)
 #define GEOSX_USE_PETSC

 /// Choice of global linear algebra interface (CMake option GEOSX_LA_INTERFACE)
-#define GEOSX_LA_INTERFACE Hypre
+#define GEOSX_LA_INTERFACE Trilinos


Reverted back to Trilinos, intentional?

(We should look into changing the way this file is handled... it's getting out of hand, especially with more macros added)

This seems to be a general issue. I must have built on lassen, and accidentally committed this file.

castelletto1 marked this pull request as draft July 16, 2020 01:26

castelletto1 assigned mquanbui, joshua-white, oseikuffuor1, castelletto1 and andrea-franceschini Jul 16, 2020

castelletto1 assigned klevzoff and rrsettgast Aug 18, 2020

castelletto1 changed the title ~~Optimizing boomerAMG parameters for SolidMechanicsLagrangianSSLE~~ boomerAMG on GPU for SolidMechanicsLagrangianSSLE Aug 18, 2020

andrea-franceschini mentioned this pull request Sep 18, 2020

Improving lagrangian contact mechanics solver + LAI capabilities #986

Merged

4 tasks

castelletto1 and others added 4 commits September 24, 2020 12:48

Removing SuperLU coarse solver

c0e3d8f

Hypre default LAI

abe1f6a

Adding model problem

a392f2a

Attempted Hypre setup for GPU execution

16ffd62

rrsettgast force-pushed the feature/boomerAMG-for-elasticity branch from d194411 to 16ffd62 Compare October 1, 2020 06:50

rrsettgast reviewed Oct 1, 2020

View reviewed changes

oseikuffuor1 reviewed Oct 1, 2020

View reviewed changes

src/coreComponents/linearAlgebra/interfaces/hypre/HypreInterface.cpp Outdated Show resolved Hide resolved

rrsettgast reviewed Oct 1, 2020

View reviewed changes

src/coreComponents/linearAlgebra/interfaces/hypre/HypreInterface.cpp Outdated Show resolved Hide resolved

rrsettgast added 8 commits October 2, 2020 18:16

change localIndex and globalIndex to an int for hypregpu. Added cudas…

6550d95

…parse and cudarand libs to the link line for hypre

update for hypre build

8af4936

added more amg options for hypre gpu study

9be628a

simulations running, but not converging, with device pointers

3ccc0e9

modify component filter

5560b69

fix bug in component filter

239bf6a

Merge branch 'develop' into feature/boomerAMG-for-elasticity

8f0234f

Merge branch 'feature/boomerAMG-for-elasticity' of github.com:GEOSX/G…

29f0a26

…EOSX into feature/boomerAMG-for-elasticity

rrsettgast added 3 commits March 7, 2021 10:51

squashme

0784195

changing some amg defaults

7fad00c

remove extra file

6ecc2cd

rrsettgast reviewed Mar 8, 2021

View reviewed changes

rrsettgast added 3 commits March 7, 2021 19:16

Apply suggestions from code review

19abb38

cleanup HypreVector

e998b3e

Merge branch 'feature/boomerAMG-for-elasticity' of github.com:GEOSX/G…

b092540

…EOSX into feature/boomerAMG-for-elasticity

rrsettgast requested review from corbett5 and klevzoff March 8, 2021 07:39

klevzoff approved these changes Mar 8, 2021

View reviewed changes

rrsettgast added 2 commits March 8, 2021 12:12

address some review comments

b68a80b

revert doxygen/GeosxConfig.hpp to hypre

a6deba0

rrsettgast added the ci: run CUDA builds Allows to triggers (costly) CUDA jobs label Mar 8, 2021

rrsettgast added 15 commits March 8, 2021 14:50

update submodule and tpl CI tag

47803fe

yet another fix for travis

1ab7abf

yet another fix for travis

87da7e6

yet another fix for travis

cd8d2a6

yet another fix for travis

4bf5284

yet another fix for travis

fe26290

yet another fix for travis

626b770

yet another fix for travis

fe409fb

yet another fix for travis

dce53ab

yet another fix for travis

084cac4

yet another fix for travis

c35a2ab

yet another fix for travis

58e86f5

yet another fix for travis

ec8590e

yet another fix for travis

9c68dc6

yet another fix for travis

1833bbb

rrsettgast merged commit 3c24607 into develop Mar 9, 2021

rrsettgast deleted the feature/boomerAMG-for-elasticity branch March 9, 2021 23:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

boomerAMG on GPU for SolidMechanicsLagrangianSSLE #1054

boomerAMG on GPU for SolidMechanicsLagrangianSSLE #1054

castelletto1 commented Jul 16, 2020 •

edited by rrsettgast

Loading

oseikuffuor1 commented Aug 20, 2020

andrea-franceschini commented Sep 18, 2020

oseikuffuor1 commented Sep 21, 2020

andrea-franceschini commented Sep 21, 2020

rrsettgast left a comment

rrsettgast Mar 8, 2021

klevzoff Mar 8, 2021

rrsettgast Mar 8, 2021

klevzoff Mar 8, 2021

rrsettgast Mar 8, 2021

		@@ -26,11 +26,13 @@ namespace geosx

		// Check matching requirements on index/value types between GEOSX and SuperLU_Dist

		#if !defined(GEOSX_USE_HYPRE_CUDA)

boomerAMG on GPU for SolidMechanicsLagrangianSSLE #1054

boomerAMG on GPU for SolidMechanicsLagrangianSSLE #1054

Conversation

castelletto1 commented Jul 16, 2020 • edited by rrsettgast Loading

oseikuffuor1 commented Aug 20, 2020

andrea-franceschini commented Sep 18, 2020

oseikuffuor1 commented Sep 21, 2020

andrea-franceschini commented Sep 21, 2020

rrsettgast left a comment

Choose a reason for hiding this comment

rrsettgast Mar 8, 2021

Choose a reason for hiding this comment

klevzoff Mar 8, 2021

Choose a reason for hiding this comment

rrsettgast Mar 8, 2021

Choose a reason for hiding this comment

klevzoff Mar 8, 2021

Choose a reason for hiding this comment

rrsettgast Mar 8, 2021

Choose a reason for hiding this comment

castelletto1 commented Jul 16, 2020 •

edited by rrsettgast

Loading