sast-ai-workflow

Overview

SAST-AI-Workflow is a LLM-based tool designed to detect and flag suspected vulnerabilities. It inspects suspicious lines of code in a given repository and deeply review legitimacy of the error. Workflow is capable of integrating SAST reports, source code analysis, CVE data and other known examples.

SAST-AI-Workflow can be incorporated to help and provide insights in the vulnerability detection precess. As an instance, in this project we demonstrate SAST scanning of RHEL systemd (source: systemd GitHub) project.

Architecture

Key components:

Input Sources

SAST HTML Reports:
Processes scan results from SAST HTML reports.
Source Code Repository:
Source code is obtained from the systemd-rhel9 repository. It recursively scans the src directory for .c and .h files and converts all detected source files into embeddings.
CVE Information:
Embeds additional CVE data extracted from HTML pages to enrich the context used for the vulnerability analysis.
Known Examples:
Incorporates known cases for better results.

Embeddings & Vector Store

Converts the source data into embeddings using a local HuggingFace model (all-mpnet-base-v2) and stores them in a FAISS vector store.

Language Model Integration

Uses NVIDIA's API via the ChatNVIDIA integration to query the vector store and generate analysis responses.

Evaluation

Applies metrics (from Ragas library) to assess the quality of model outputs.

A detailed architecture diagram is provided in the diagrams/ folder (e.g., diagrams/architecture.png).

Evaluation & Metrics

The evaluations of the model responses are being done using the following metrics:

Response Relevancy:
Ensures that the generated answers are directly related to the query.
Response Relevancy.

Installation & Setup

1. Clone the Repository

git clone git@github.com:RHEcosystemAppEng/sast-ai-workflow.git

2. Download Secret Configuration Files

Retrieve the secret configuration files from the project’s Google Drive and place them in the appropriate location.

3. Optional - Use Existing FAISS Index

If you prefer not to generate embeddings for the source code files, download the index.faiss file from the drive and place it under the appropriate folder (e.g., the src folder).

4. Install Dependencies

Install the required dependencies:

pip install -r requirements.txt

5. Configure Environment Variables

Create a .env file (or use the existing one in the drive and place it) in the root directory and set the following:

NVIDIA_URL=<your_nvidia_url>
NVIDIA_API_KEY=<your_nvidia_api_key>
NVIDIA_LLM_MODEL_NAME=<your_nvidia_llm_model_name>
NVIDIA_EMBEDDINGS_LLM_MODEL_NAME=<your_nvidia_embedding_model_name>

6. Install the Embedding Model

Download the embedding model locally:

git clone https://huggingface.co/sentence-transformers/all-mpnet-base-v2

Alternatively, if you are using the OpenShift cluster, follow the provided cluster-specific instructions.

Usage

Run the main workflow by executing:

python run.py

This command will:

Process the SAST report -> Generate embeddings from the input sources -> Query the language model to analyze the vulnerabilities -> Evaluate the response using the defined metrics -> Export the final summary to an Excel file.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
diagrams		diagrams
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sast-ai-workflow

Overview

Architecture

Input Sources

Embeddings & Vector Store

Language Model Integration

Evaluation

Evaluation & Metrics

Installation & Setup

1. Clone the Repository

2. Download Secret Configuration Files

3. Optional - Use Existing FAISS Index

4. Install Dependencies

5. Configure Environment Variables

6. Install the Embedding Model

Usage

About

Releases

Packages

Contributors 2

Languages

License

RHEcosystemAppEng/sast-ai-workflow

Folders and files

Latest commit

History

Repository files navigation

sast-ai-workflow

Overview

Architecture

Input Sources

Embeddings & Vector Store

Language Model Integration

Evaluation

Evaluation & Metrics

Installation & Setup

1. Clone the Repository

2. Download Secret Configuration Files

3. Optional - Use Existing FAISS Index

4. Install Dependencies

5. Configure Environment Variables

6. Install the Embedding Model

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages