# Tooling and Ecosystem Standards for LLVM

This document outlines the recommended tooling and ecosystem standards for LLVM development. Adhering to these standards ensures a consistent, maintainable, and performant codebase across the LLVM project. These guidelines aim to improve developer productivity, code quality, and collaboration within the LLVM community.

## 1. Development Environment and Build System

### 1.1. Recommended Development Environment

* **Do This:** Use a standardized development environment to ensure consistency across platforms and teams. Consider using Docker containers or similar virtualization technologies to encapsulate dependencies.

* **Don't Do This:** Rely on ad-hoc setups that are difficult to reproduce and may lead to environment-specific bugs.

**Why:** Consistency reduces "works on my machine" issues and simplifies collaboration.

**Example:**

"""dockerfile

# Dockerfile for LLVM development

FROM ubuntu:latest

# Install dependencies

RUN apt-get update && apt-get install -y \

build-essential \

cmake \

git \

python3 \

ninja-build \

clang \

lld \

libtinfo-dev

# Set up LLVM source directory

WORKDIR /llvm

# Clone LLVM (replace with specific branch/version if needed)

RUN git clone https://github.com/llvm/llvm-project.git

# Set environment variables

ENV LLVM_SRC=/llvm/llvm-project

ENV PATH="$PATH:/llvm/llvm-project/build/bin"

# Optional: Install additional tools like clang-tools-extra

# RUN apt-get install -y clang-tools-extra

# Build LLVM (example)

WORKDIR /llvm/llvm-project/build

RUN cmake -G Ninja -DLLVM_ENABLE_PROJECTS="clang;lld" -DCMAKE_BUILD_TYPE=Release ../llvm

RUN ninja

"""

### 1.2. Build System - CMake

* **Do This:** Utilize the CMake build system provided by LLVM. Follow the standard CMake practices for defining targets, dependencies, and build options.

* **Don't Do This:** Resort to custom build scripts or Makefile hacks that bypass the CMake infrastructure.

**Why:** CMake provides a portable, efficient, and well-supported build system that integrates seamlessly with LLVM's tooling.

**Example:**

"""cmake

# CMakeLists.txt for a new LLVM tool

cmake_minimum_required(VERSION 3.13) # Use at least CMake 3.13

project(MyNewTool)

# Find LLVM

set(LLVM_DIR /path/to/llvm/build/lib/cmake/llvm) # Replace with your LLVM build directory.

find_package(LLVM REQUIRED CONFIG)

# Add source files

add_executable(MyNewTool MyNewTool.cpp)

# Link against LLVM libraries

target_link_libraries(MyNewTool LLVMSupport LLVMCore) # Add more libraries as needed.

# Install the tool

install(TARGETS MyNewTool DESTINATION bin)

"""

### 1.3. Build Types

* **Do This:** Use the standard CMake build types: "Debug", "Release", "RelWithDebInfo", and "MinSizeRel". Use "Debug" builds for development and debugging. Use "Release", "RelWithDebInfo", or "MinSizeRel" when profiling or deploying.

* **Don't Do This:** Create custom, non-standard build types without a clear justification, or mix debug and release flags manually.

**Why:** Standard build types are optimized for specific scenarios. Debug builds provide symbolic information for debugging, and release builds are optimized for performance.

### 1.4. Ninja Build System

* **Do This:** Use Ninja as a CMake generator ("cmake -G Ninja ...").

* **Don't Do This:** Rely solely on Makefiles (unless there's a specific reason).

**Why:** Ninja generally provides faster build times compared to Makefiles, especially for large projects like LLVM.

### 1.5. Ccache or similar caching tools

* **Do This:** Use "ccache" or "sccache" to significantly reduce compilation times, especially in CI environments.

* **Don't Do This:** Neglect to configure or utilize these tools when doing frequent builds.

**Why:** These tools improve iterative development speed by caching and reusing compilation results.

"""bash

#Example integration (assuming ccache is installed)

export CCACHE_DIR=/path/to/ccache

export CCACHE_MAXSIZE=10G

cmake -G Ninja -DCMAKE_CXX_COMPILER_LAUNCHER=ccache ...

"""

## 2. LLVM Libraries and Tools

### 2.1. Utilizing LLVM Support Libraries

* **Do This:** Leverage LLVM's comprehensive support libraries for common tasks like string manipulation ("llvm::StringRef", "llvm::Twine"), data structures ("llvm::SmallVector", "llvm::DenseMap"), and file system access ("llvm::sys::fs").

* **Don't Do This:** Re-implement functionality already provided by LLVM's support libraries. Avoid using "std::string" where "llvm::StringRef" is more appropriate (read-only string access).

**Why:** LLVM support libraries are highly optimized and integrated within the LLVM ecosystem. They also promote code reuse and consistency.

**Example:**

"""c++

#include "llvm/Support/raw_ostream.h"

#include "llvm/ADT/StringRef.h"

void printMessage(llvm::StringRef Message) {

llvm::outs() << "Message: " << Message << "\n";

}

int main() {

const char* text = "Hello, LLVM!";

llvm::StringRef message(text);

printMessage(message);

return 0;

}

"""

### 2.2. Using LLVM's Diagnostics Infrastructure

* **Do This:** Employ LLVM's diagnostic reporting mechanism ("llvm::DiagnosticInfo", "llvm::DiagnosticPrinter", "llvm::DiagnosticHandler") to issue errors, warnings, and remarks. This is especially true when developing compiler passes.

* **Don't Do This:** Use raw "fprintf" or "std::cerr" for diagnostic output, as this bypasses LLVM's structured error handling.

**Why:** LLVM's diagnostic system provides a unified way to report diagnostic information, enabling better integration with IDEs and tools.

**Example:**

"""c++

#include "llvm/Support/raw_ostream.h"

#include "llvm/IR/DiagnosticInfo.h"

#include "llvm/IR/DiagnosticPrinter.h"

#include "llvm/IR/LLVMContext.h"

class MyDiagnosticInfo : public llvm::DiagnosticInfo {

public:

MyDiagnosticInfo(llvm::StringRef Message, llvm::DiagnosticSeverity Severity)

: llvm::DiagnosticInfo(DS_Remark), Message(Message), Severity(Severity){}

void print(llvm::DiagnosticPrinter &DP) const override {

DP << "MyCustomTool: " << Message;

}

llvm::DiagnosticSeverity getSeverity() const override { return Severity; }

private:

llvm::StringRef Message;

llvm::DiagnosticSeverity Severity;

};

void emitDiagnostic(llvm::LLVMContext &Context, llvm::StringRef Message, llvm::DiagnosticSeverity Severity) {

Context.diagnose(MyDiagnosticInfo(Message, Severity));

}

int main() {

llvm::LLVMContext Context;

Context.setDiagnosticHandler([](const llvm::DiagnosticInfo &DI, void *Context) {

llvm::DiagnosticPrinterRawOStream DP(llvm::errs());

DI.print(DP);

llvm::errs() << '\n';

if (DI.getSeverity() == llvm::DS_Error)

exit(1);

}, nullptr);

emitDiagnostic(Context, "This is a warning!", llvm::DS_Warning);

emitDiagnostic(Context, "This is an error!", llvm::DS_Error);

return 0;

}

"""

### 2.3. TableGen

* **Do This:** Use TableGen to describe declarative information such as instruction sets, register definitions, and code patterns. Define data in ".td" files and generate C++ code using TableGen tools.

* **Don't Do This:** Hardcode these data directly in C++. Avoid manual modifications of generated code.

**Why:** TableGen allows you to describe data in a structured, declarative way, which makes it easier to maintain and extend.

**Example:**

"""tablegen

// Example .td file

class MyInstruction pattern> : Instruction {

string OpCodeStr = opCodeStr;

list Pattern = pattern;

let Namespace = "MY";

}

def MyADD : MyInstruction<"add", [(add i32:$src1, i32:$src2)]>;

"""

Then, use the appropriate TableGen backend to generate C++ code based on this definition.

### 2.4. Versioning and Compatibility

* **Do This:** Follow LLVM's versioning scheme meticulously. Use compatibility macros and conditional compilation to ensure that your code can be compiled with older or newer versions of LLVM. Consult the LLVM release notes for API changes and deprecations.

* **Don't Do This:** Assume that the LLVM API will remain stable across releases.

**Why:** LLVM evolves rapidly, and maintaining compatibility is crucial for long-term project health.

**Example:**

"""c++

#include "llvm/Support/raw_ostream.h"

#if LLVM_VERSION_MAJOR >= 16 //Example version check

#include "llvm/NewHeader.h"

#define NEW_API_AVAILABLE 1

#else

#define NEW_API_AVAILABLE 0

#endif

void myFunction() {

#if NEW_API_AVAILABLE

llvm::outs() << "Using the new API.\n";

#else

llvm::outs() << "Using the old API.\n";

#endif

}

"""

## 3. Coding Practices and Conventions

### 3.1. Code Formatting and Style

* **Do This:** Adhere strictly to the LLVM coding style guidelines. Use "clang-format" to automatically format your code. Configure your editor to run "clang-format" on save.

* **Don't Do This:** Deviate from the established coding style.

**Why:** Consistent code formatting improves readability and reduces merge conflicts.

**Example:**

To format your code, use the following command:

"""bash

clang-format -i MyFile.cpp

"""

The LLVM coding style guide can be found at: [https://llvm.org/docs/CodingStandards.html](https://llvm.org/docs/CodingStandards.html)

### 3.2. Code Reviews

* **Do This:** Submit your code for review using Phabricator (the LLVM code review tool). Provide clear explanations of your changes and address reviewer feedback promptly and thoroughly.

* **Don't Do This:** Bypass the code review process.

**Why:** Code reviews help catch errors, improve code quality, and disseminate knowledge.

### 3.3. Testing

* **Do This:** Write comprehensive unit tests and integration tests for your code. Use LLVM's lit testing framework. Add new tests for bug fixes and new features.

* **Don't Do This:** Neglect testing or submit code with inadequate test coverage.

**Why:** Thorough testing is essential to ensure the correctness and stability of LLVM.

**Example:**

Create a "test/MyTest.ll" file:

"""llvm

; RUN: FileCheck %s < %s

define i32 @main() {

; CHECK: Hello, LLVM!

call void @printMessage()

ret i32 0

}

declare void @printMessage()

"""

Create a "MyTest.cpp" driver that defines "@printMessage" and runs the LLVM IR code above. Then, create suitable "CMakeLists.txt" file to link the driver and the tests. Then, run "lit" in the build directory to execute the test.

### 3.4. Documentation

* **Do This:** Document your code clearly and concisely using Doxygen-style comments. Provide high-level documentation for public APIs and data structures. Update documentation when you change the code.

* **Don't Do This:** Neglect documentation or write ambiguous or outdated documentation.

**Why:** Good documentation makes it easier for others (and your future self) to understand and maintain your code.

**Example:**

"""c++

/**

* @brief This function calculates the sum of two integers.

* @param A The first integer.

* @param B The second integer.

* @return The sum of A and B.

int add(int A, int B) {

return A + B;

}

"""

### 3.5. Memory Management

* **Do This:** Use smart pointers ("std::unique_ptr", "std::shared_ptr") or LLVM's "BumpPtrAllocator" for memory management. Adhere to RAII principles. When using "BumpPtrAllocator", allocate memory in arenas and avoid manual "delete" calls.

* **Don't Do This:** Use raw pointers and manual "new"/"delete" without a clear understanding of ownership semantics.

**Why:** Proper memory management prevents memory leaks and dangling pointers.

**Example:**

"""c++

#include "llvm/Support/Allocator.h"

#include

void processData() {

llvm::BumpPtrAllocator Allocator;

std::unique_ptr data(new (Allocator.Allocate(sizeof(int) * 10)) int[10]); // Allocate using BumpPtrAllocator

for (int i = 0; i < 10; ++i) {

data[i] = i * 2;

}

// Data will be automatically deallocated when Allocator goes out of scope.

}

"""

### 3.6 Exception Safety

* **Do This:** Design your code to be exception-safe. Ensure that resources are properly released in the presence of exceptions (using RAII, smart pointers, or try-catch blocks). Consider whether exceptions are even appropriate for your particular code. LLVM generally discourages the use of exceptions in performance-critical code.

* **Don't Do This:** Write code that leaks resources or corrupts data structures if an exception is thrown.

**Why:** Exception safety prevents unexpected behavior and data corruption.

### 3.7 Concurrency and Thread Safety

* **Do This:** When writing multi-threaded code, use LLVM's threading utilities (e.g., "llvm::thread", "llvm::mutex", "llvm::atomic") or standard C++ threading primitives ("std::thread", "std::mutex", "std::atomic"). Ensure that your code is thread-safe by using proper locking and synchronization mechanisms. Consider using LLVM's parallel algorithms (e.g. "llvm::for_each") where applicable.

* **Don't Do This:** Introduce data races or deadlocks in multi-threaded code. Use global mutable variables without proper synchronization.

**Why:** Concurrency bugs can be difficult to detect and debug.

### 3.8 Performance Optimization

* **Do This:** Profile your code to identify performance bottlenecks. Use appropriate data structures and algorithms. Avoid unnecessary memory allocations and copies. Consider using LLVM's intrinsics for optimized operations. Use tools like "perf" or "VTune" to analyze performance.

* **Don't Do This:** Make premature optimizations without profiling. Ignore performance implications.

**Why:** Optimizing performance is crucial for LLVM's functionality as a compiler infrastructure.

### 3.9 Security Best Practices

* **Do This:** Be aware of common security vulnerabilities (e.g., buffer overflows, format string bugs, integer overflows). Use safe coding practices to prevent these vulnerabilities. Validate external inputs. Consider using static analysis tools to detect security flaws.

* **Don't Do This:** Ignore security implications or introduce vulnerabilities into the codebase.

**Why:** Security vulnerabilities can compromise the integrity and reliability of LLVM-based tools.

### 3.10 Tooling Integration

* **Do This:** Integrate your tools with existing LLVM infrastructure, like FileCheck. Use standard LLVM libraries for common tasks. Contribute reusable components back to LLVM when appropriate.

* **Don't Do This:** Reinvent the wheel or create standalone tools that duplicate existing functionality.

**Why:** A unified ecosystem improves toolchain maintainability and reusability.

## 4. Recommended Libraries and Tools

* **clang-format:** Automatic code formatter for LLVM code style.

* **clang-tidy:** Static analysis tool for detecting code defects.

* **lit:** LLVM's integrated testing tool.

* **FileCheck:** Flexible pattern matching utility for testing.

* **CMake:** Cross-platform build system.

* **Ninja:** Fast build system.

* **valgrind:** Memory debugging and profiling tool.

* **gdb/lldb:** Debuggers for C++.

* **perf/VTune:** Performance analysis tools.

* **Phabricator:** Code review tool used by LLVM.

* **TableGen:** Tool for generating code from declarative descriptions.

## 5. Deprecated Features and Known Issues

* Refer to the latest LLVM release notes for information on deprecated features and known issues. Avoid using deprecated APIs or features.

* Be aware of potential bugs in third-party libraries or tools. Report any issues you find to the appropriate developers.

## 6. Conclusion

Following these tooling and ecosystem standards will contribute to a higher quality, more maintainable, and more efficient LLVM project. By adhering to these guidelines, developers can ensure consistency, improve collaboration, and prevent common pitfalls. These practices ensure that code is aligned with the constantly evolving environment and keeps the project's goals in focus.

Cline

This guide explains how to effectively use .clinerules with Cline, the AI-powered coding assistant.

Overview

The .clinerules file is a powerful configuration file that helps Cline understand your project's requirements, coding standards, and constraints. When placed in your project's root directory, it automatically guides Cline's behavior and ensures consistency across your codebase.

Key Concepts

Purpose of .clinerules

Defines project-specific guidelines and requirements
Enforces consistent coding standards
Establishes documentation practices
Sets testing and quality requirements
Configures error handling preferences

File Location

Place the .clinerules file in your project's root directory. Cline automatically detects and follows these rules for all files within the project.

Rule Structure

1. Project Overview

# Project Overview
project:
  name: 'Your Project Name'
  description: 'Brief project description'
  stack:
    - technology: 'Framework/Language'
      version: 'X.Y.Z'
    - technology: 'Database'
      version: 'X.Y.Z'

2. Code Standards

# Code Standards
standards:
  style:
    - 'Use consistent indentation (2 spaces)'
    - 'Follow language-specific naming conventions'
  documentation:
    - 'Include JSDoc comments for all functions'
    - 'Maintain up-to-date README files'
  testing:
    - 'Write unit tests for all new features'
    - 'Maintain minimum 80% code coverage'

3. Security Rules

# Security Guidelines
security:
  authentication:
    - 'Implement proper token validation'
    - 'Use environment variables for secrets'
  dataProtection:
    - 'Sanitize all user inputs'
    - 'Implement proper error handling'

Best Practices

Writing Effective Rules

Be Specific
- Use clear, actionable language
- Provide examples where helpful
- Define measurable criteria
Maintain Organization
- Group related rules together
- Use consistent formatting
- Keep critical rules at the top
Regular Updates
- Review rules periodically
- Update based on team feedback
- Document changes in version control

Common Patterns

# Common Patterns Example
patterns:
  components:
    - pattern: 'Use functional components by default'
    - pattern: 'Implement error boundaries for component trees'
  stateManagement:
    - pattern: 'Use React Query for server state'
    - pattern: 'Implement proper loading states'

Integration with Development Workflow

Using with Version Control

Commit the Rules
- Include .clinerules in version control
- Document rule changes in commit messages
- Review rule changes as part of PR process
Team Collaboration
- Discuss rule changes with team
- Maintain changelog for rule updates
- Ensure all team members understand rules

Troubleshooting

Common Issues

Rules Not Being Applied
- Verify file location (must be in root directory)
- Check file formatting
- Ensure Cline has access to the file
Conflicting Rules
- Review rule hierarchy
- Resolve conflicts explicitly
- Document rule precedence
Performance Considerations
- Keep rules concise and focused
- Avoid overly complex rule structures
- Regular cleanup of obsolete rules

Examples

Basic Project Setup

# Basic .clinerules Example
project:
  name: 'Web Application'
  type: 'Next.js Frontend'
  standards:
    - 'Use TypeScript for all new code'
    - 'Follow React best practices'
    - 'Implement proper error handling'

testing:
  unit:
    - 'Jest for unit tests'
    - 'React Testing Library for components'
  e2e:
    - 'Cypress for end-to-end testing'

documentation:
  required:
    - 'README.md in each major directory'
    - 'JSDoc comments for public APIs'
    - 'Changelog updates for all changes'

Advanced Configuration

# Advanced .clinerules Example
project:
  name: 'Enterprise Application'
  compliance:
    - 'GDPR requirements'
    - 'WCAG 2.1 AA accessibility'

architecture:
  patterns:
    - 'Clean Architecture principles'
    - 'Domain-Driven Design concepts'

security:
  requirements:
    - 'OAuth 2.0 authentication'
    - 'Rate limiting on all APIs'
    - 'Input validation with Zod'

Tooling and Ecosystem Standards for LLVM

Cline

Overview

Key Concepts

Purpose of .clinerules

File Location

Rule Structure

1. Project Overview

2. Code Standards

3. Security Rules

Best Practices

Writing Effective Rules

Common Patterns

Integration with Development Workflow

Using with Version Control

Troubleshooting

Common Issues

Examples

Basic Project Setup

Advanced Configuration

Related Rules

Deployment and DevOps Standards for LLVM

Security Best Practices Standards for LLVM

Core Architecture Standards for LLVM

Component Design Standards for LLVM

State Management Standards for LLVM