# State Management Standards for PostgreSQL

This document outlines the coding standards and best practices for state management within PostgreSQL applications. It aims to guide developers in building maintainable, performant, and secure database-driven applications.

## 1. Introduction to State Management in PostgreSQL

State management is a crucial aspect of application development, referring to how an application maintains and utilizes data between user interactions or different points in time. In the context of PostgreSQL, state management encompasses not only the data stored in tables but also transient data, session information, application-specific states, and data flows controlled within the database itself using SQL and procedural languages. Effective state management is vital for:

* **Maintainability:** Well-defined state helps in understanding the system's behavior and simplifies debugging and modifications.

* **Performance:** Efficient state management avoids unnecessary data reads and writes, thereby improving overall system performance.

* **Consistency:** Reliable state handling prevents data corruption and ensures predictable application behavior.

* **Scalability:** The ability to manage state efficiently is essential for scaling applications to handle increased load.

## 2. Approaches to State Management in PostgreSQL

### 2.1 Data Persistence with Tables

The primary responsibility of a PostgreSQL database is the persistent storage of data within tables. Data modelling is the MOST important aspect of state management in tables.

**Do This:**

* **Normalization:** Adhere to database normalization principles (1NF, 2NF, 3NF, etc.) to reduce redundancy and improve data integrity.

* **Data Types:** Use appropriate data types for each column (e.g., "INTEGER", "TEXT", "DATE", "JSONB", "UUID"). Choosing the correct type significantly impacts storage and performance.

* **Constraints:** Employ constraints ("NOT NULL", "UNIQUE", "CHECK", "FOREIGN KEY") to enforce data integrity and business rules directly within the database.

* **Indexes:** Use indexes strategically on frequently queried columns to speed up data retrieval. Consider composite indexes for complex queries involving multiple columns.

* **Partitioning:** For large tables, consider partitioning to improve query performance and manageability. Partitioning allows you to divide a large table into smaller, more manageable pieces. Use declarative partitioning (introduced in PostgreSQL 10) because it often has better query performance and less complexity (compared to trigger based approaches).

**Don't Do This:**

* **Over-normalization:** Avoid excessive normalization that can lead to complex joins and reduced query performance.

* **Generic Data Types:** Using "TEXT" for everything. Use the most specific applicable type. For example, use "INTEGER" instead of "TEXT" when storing an integer value.

* **Ignoring Contraints:** Omitting constraints can introduce schema-level errors, causing cascading problems down the road.

* **Unnecessary Indexes:** Adding too many indexes can slow down write operations and increase storage costs. Regularly review and remove unused indexes.

* **Ignoring Data Locality:** Consider data access when implementing partitions to further improve performance.

**Example:**

"""sql

-- Creating a table with appropriate data types and constraints

CREATE TABLE users (

user_id UUID PRIMARY KEY DEFAULT gen_random_uuid(),

username VARCHAR(50) UNIQUE NOT NULL,

email VARCHAR(255) UNIQUE NOT NULL,

password_hash VARCHAR(255) NOT NULL,

created_at TIMESTAMP WITH TIME ZONE DEFAULT now()

);

-- Creating an index on the email column

CREATE INDEX idx_users_email ON users (email);

-- Creating a partitioned table (example for user activity)

CREATE TABLE user_activity (

activity_id UUID PRIMARY KEY DEFAULT gen_random_uuid(),

user_id UUID NOT NULL,

activity_type VARCHAR(50) NOT NULL,

activity_time TIMESTAMP WITH TIME ZONE NOT NULL

) PARTITION BY RANGE (activity_time);

-- Creating partitions for different time ranges

CREATE TABLE user_activity_2023_01 PARTITION OF user_activity

FOR VALUES FROM ('2023-01-01') TO ('2023-02-01');

CREATE TABLE user_activity_2023_02 PARTITION OF user_activity

FOR VALUES FROM ('2023-02-01') TO ('2023-03-01');

ALTER TABLE user_activity

ADD CONSTRAINT fk_user_activity_user_id

FOREIGN KEY (user_id) REFERENCES users(user_id);

"""

### 2.2 Using Temporary Tables

Temporary tables are useful for storing intermediate results during complex queries and operations. They exist only for the duration of the session or transaction in which they were created.

**Do This:**

* **Transaction-specific vs. Session-specific:** Choose between "CREATE TEMP TABLE" (available for the entire session) and temporary tables created within a transaction. Transaction specific tables are automatically dropped at the end of the transaction.

* **Unlogged Temporary Tables:** For improved performance with data that doesn't need to be durable, use "CREATE UNLOGGED TEMP TABLE". They are faster because write-ahead logging is skipped. Data in unlogged tables is not crash-safe.

* **Optimize Usage:** Use temporary tables where they significantly simplify complex queries or improve performance by pre-computing intermediate results.

**Don't Do This:**

* **Overuse:** Avoid excessive use of temporary tables, as creating and managing them has overhead. Evaluate if CTEs (Common Table Expressions) can achieve the same result more efficiently.

* **Unnecessary Durability:** Using plain "CREATE TEMP TABLE" if data doesn't require durability at the transaction commit.

* **Ignoring Indexing:** For non-trivial temporary tables use indexes on where clauses.

**Example:**

"""sql

-- Using a temporary table to pre-compute intermediate results

CREATE TEMP TABLE monthly_sales AS

SELECT

EXTRACT(MONTH FROM order_date) AS month,

SUM(order_total) AS total_sales

FROM

orders

WHERE

EXTRACT(YEAR FROM order_date) = 2023

GROUP BY

month;

-- Querying the temporary table

SELECT

month,

total_sales

FROM

monthly_sales

ORDER BY

month;

-- using an UNLOGGED table

CREATE UNLOGGED TEMP TABLE unlogged_example (id INT, val TEXT);

INSERT INTO unlogged_example (id, val) VALUES (1, 'test');

SELECT * FROM unlogged_example;

"""

### 2.3 Using Common Table Expressions (CTEs)

CTEs (Common Table Expressions) allow you to define temporary result sets within a query, improving readability and maintainability.

**Do This:**

* **Recursive CTEs:** Utilize recursive CTEs for hierarchical data structures or iterative computations.

* **Readability:** Use CTEs to break down complex queries into smaller, more understandable parts.

**Don't Do This:**

* **Over-nesting:** Avoid deeply nested CTEs that can become difficult to manage.

* **Performance Misconceptions:** Understand that CTEs are primarily for readability and modularity, not necessarily for performance optimization. CTEs might not always be materialized (optimized by the query planner).

**Example:**

"""sql

-- Non-Recursive CTE: Calculating average order value

WITH order_summary AS (

SELECT

customer_id,

COUNT(*) AS order_count,

SUM(order_total) AS total_spent

FROM

orders

GROUP BY

customer_id

)

SELECT

AVG(total_spent / order_count) AS average_order_value

FROM

order_summary;

-- Recursive CTE: Generating a sequence of numbers

WITH RECURSIVE number_series AS (

SELECT 1 AS n

UNION ALL

SELECT n + 1 FROM number_series WHERE n < 10

)

SELECT n FROM number_series;

"""

### 2.4 Sessions and Application Context

Setting Application Context variables through the "pg_catalog.set_config" function to set flags, user IDs, and other context-specific data on a session.

**Do This:**

* **Authentication Propagation:** After user authentication, set the "user_id" or role to be used in subsequent queries.

* **Centralize Access:** Implement functions to set and retrieve application context, creating a uniform approach throughout the application.

**Don't Do This:**

* **Direct Access:** Avoid directly setting context variables in ad-hoc queries, which defeats the purpose of having a consistent approach.

* **Misusing for Configuration Parameters** While "set_config" *can* be used for configuration, it's best left to *application context*. Use "ALTER SYSTEM" or parameters in "postgresql.conf" for global configurations.

**Example:**

"""sql

-- Setting a context value after user authentication

SELECT pg_catalog.set_config('myapp.user_id', '123', false);

-- Function to retrieve the user ID from the application context

CREATE OR REPLACE FUNCTION get_current_user_id()

RETURNS TEXT AS $$

BEGIN

RETURN pg_catalog.current_setting('myapp.user_id', true);

END;

$$ LANGUAGE plpgsql;

-- Using the function in a query

SELECT * FROM orders WHERE user_id = get_current_user_id();

"""

### 2.5 Using JSONB for Flexible State

PostgreSQL's "JSONB" data type allows you to store semi-structured data within a column.

**Do This:**

* **Configuration Options:** Store user preferences, application settings, or dynamic configurations as JSONB objects.

* **Use Indexes:** Index JSONB columns to efficiently query data within the JSON structure using "@>", "?", "?|", "?&". Consider GIN indexes.

**Don't Do This:**

* **Abuse JSONB:** Don't use JSONB as a replacement for proper relational data modeling. Use it only for genuinely semi-structured data with variable schemas.

**Example:**

"""sql

-- Storing user preferences as JSONB

CREATE TABLE user_preferences (

user_id UUID PRIMARY KEY,

preferences JSONB

);

-- Inserting user preferences

INSERT INTO user_preferences (user_id, preferences) VALUES (

'a1b2c3d4-e5f6-7890-1234-567890abcdef',

'{"theme": "dark", "notifications": {"email": true, "sms": false}}'

);

-- Querying for users with email notifications enabled

SELECT user_id

FROM user_preferences

WHERE preferences -> 'notifications' ->> 'email' = 'true';

-- Indexing jsonb column

CREATE INDEX idx_user_preferences_notifications ON user_preferences USING GIN (preferences jsonb_path_ops);

"""

### 2.6 Row-Level Security (RLS)

Row-Level Security (RLS) allows you to define policies that restrict access to rows in a table based on user attributes or application context.

**Do This:**

* **Data Isolation:** Use RLS to enforce access control policies, such as ensuring users can only see their own data.

* **Audit Trails:** Implement RLS policies that log access attempts, which are useful for auditing purposes.

**Don't Do This:**

* **Complex Policies:** Avoid creating overly complex RLS policies that can impact performance. Ensure policies are well-indexed and optimized.

* **Ignoring Performance:** Before adopting RLS in production, performance test with production-scale loads.

**Example:**

"""sql

-- Enable RLS on the orders table

ALTER TABLE orders ENABLE ROW LEVEL SECURITY;

-- Create a policy that allows users to only see their own orders

CREATE POLICY user_orders_policy ON orders

FOR SELECT

USING (user_id = get_current_user_id()); -- Assuming you have implemented get_current_user_id()

-- Create a dummy function

CREATE OR REPLACE FUNCTION get_current_user_id()

RETURNS UUID AS $$

BEGIN

RETURN 'a1b2c3d4-e5f6-7890-1234-567890abcdef'; -- Replace with a mechanism to get the real user ID

END;

$$ LANGUAGE plpgsql;

"""

### 2.7 Publish/Subscribe with "NOTIFY/LISTEN"

PostgreSQL's "NOTIFY/LISTEN" mechanism provides a simple publish/subscribe functionality useful for real-time updates and event-driven architectures.

**Do This:**

* **Real-time Updates:** Use "NOTIFY" to signal clients when data has changed, allowing them to refresh their views.

* **Background Processing:** Use "LISTEN" to trigger background tasks or worker processes when specific events occur.

**Don't Do This:**

* **Reliable Message Queuing:** Don't rely on "NOTIFY/LISTEN" for guaranteed message delivery, as messages can be lost if the client is disconnected. Use a dedicated message queue system (e.g., RabbitMQ, Kafka) for reliable messaging.

* **Overusing Triggers:** Avoid overuse of triggers invoking "NOTIFY", which can lead to performance bottlenecks.

**Example:**

"""sql

-- Trigger function to notify clients when a new order is created

CREATE OR REPLACE FUNCTION notify_new_order()

RETURNS TRIGGER AS $$

BEGIN

PERFORM pg_notify('new_order_channel', row_to_json(NEW)::text);

RETURN NEW;

END;

$$ LANGUAGE plpgsql;

-- Creating a trigger on the orders table

CREATE TRIGGER new_order_trigger

AFTER INSERT ON orders

FOR EACH ROW

EXECUTE FUNCTION notify_new_order();

-- Example usage from the client side (psql)

LISTEN new_order_channel;

-- Then insert a new record to "orders"

--INSERT INTO orders (order_id, user_id, order_total) VALUES (gen_random_uuid(), '849aa4ef-9c92-4599-b50a-47d23a24c85b', 100.00);

"""

### 2.8 Advisory Locks

Advisory locks allow you to implement application-level locking mechanisms to coordinate access to shared resources.

**Do This:**

* **Task Synchronization:** Use advisory locks to prevent concurrent execution of critical tasks or operations.

* **Resource Protection:** Use advisory locks to protect shared resources, such as files or external systems.

* **"pg_advisory_lock" vs. "pg_advisory_lock_shared":** Choose the appropriate lock type depending on whether you need exclusive or shared access to the resource.

**Don't Do This:**

* **Deadlocks:** Be careful to avoid deadlocks when using advisory locks. Acquire locks in a consistent order. Always release locks in a timely manner.

* **Unreleased Locks:** When an application terminates abnormally, locks might not be released. Implement mechanisms to automatically release locks in such cases.

**Example:**

"""sql

-- Acquiring an advisory lock

SELECT pg_advisory_lock(123);

-- Attempting to acquire a lock that's already held

SELECT pg_try_advisory_lock(123); -- Returns false if the lock is not acquired

-- Releasing the advisory lock

SELECT pg_advisory_unlock(123);

"""

## 3. Managing Data Flow and Reactivity

Managing the flow of data through the database involves strategies to ensure that data changes cascade appropriately and that applications react quickly to state transitions.

### 3.1 Triggers

Triggers can execute custom functions in response to INSERT, UPDATE, or DELETE operations.

**Do This:**

* **Auditing:** Create audit logs automatically whenever data is modified.

* **Data Validation:** Enforce complex validation rules that cannot be expressed with "CHECK" constraints.

**Don't Do This:**

* **Complex Business Logic:** Avoid placing complex business logic in triggers, as it can make the system harder to understand and debug. Prefer application-layer logic.

* **Cascading Operations:** Limit the scope of triggered actions to avoid complex cascading operations that can lead to performance issues.

**Example:**

"""sql

-- Creating a trigger function to audit changes to the products table

CREATE OR REPLACE FUNCTION audit_products()

RETURNS TRIGGER AS $$

BEGIN

INSERT INTO product_audit (product_id, old_name, new_name, updated_at)

VALUES (OLD.product_id, OLD.name, NEW.name, now());

RETURN NEW;

END;

$$ LANGUAGE plpgsql;

-- Creating a trigger on the products table

CREATE TRIGGER products_audit_trigger

AFTER UPDATE OF name ON products

FOR EACH ROW

EXECUTE FUNCTION audit_products();

"""

### 3.2 Materialized Views

Materialized views store the results of a query as a table, which can be refreshed periodically or on demand.

**Do This:**

* **Pre-computed Aggregations:** Use materialized views to store pre-computed aggregations for frequently accessed data.

* **Complex Joins:** Materialize the results of complex joins for faster retrieval.

**Don't Do This:**

* **Real-time Data:** Don't use materialized views for data that requires real-time updates, as they need to be refreshed explicitly.

* **Ignoring Refresh Costs:** Ensure refreshing the materialized view doesn't become a performance bottleneck. Consider "REFRESH MATERIALIZED VIEW CONCURRENTLY"

**Example:**

"""sql

-- Creating a materialized view of daily sales totals

CREATE MATERIALIZED VIEW daily_sales_summary AS

SELECT

order_date,

SUM(order_total) AS total_sales

FROM

orders

GROUP BY

order_date;

-- Refreshing the materialized view

REFRESH MATERIALIZED VIEW CONCURRENTLY daily_sales_summary;

"""

## 4. Conclusion

These standards provide a foundation for building robust and maintainable applications using PostgreSQL. Adhering to these guidelines will ensure best practices are followed and data management is optimized for performance and scalability. This document should be a living document, updated as new features and best practices emerge within the PostgreSQL ecosystem.

Cline

This guide explains how to effectively use .clinerules with Cline, the AI-powered coding assistant.

Overview

The .clinerules file is a powerful configuration file that helps Cline understand your project's requirements, coding standards, and constraints. When placed in your project's root directory, it automatically guides Cline's behavior and ensures consistency across your codebase.

Key Concepts

Purpose of .clinerules

Defines project-specific guidelines and requirements
Enforces consistent coding standards
Establishes documentation practices
Sets testing and quality requirements
Configures error handling preferences

File Location

Place the .clinerules file in your project's root directory. Cline automatically detects and follows these rules for all files within the project.

Rule Structure

1. Project Overview

# Project Overview
project:
  name: 'Your Project Name'
  description: 'Brief project description'
  stack:
    - technology: 'Framework/Language'
      version: 'X.Y.Z'
    - technology: 'Database'
      version: 'X.Y.Z'

2. Code Standards

# Code Standards
standards:
  style:
    - 'Use consistent indentation (2 spaces)'
    - 'Follow language-specific naming conventions'
  documentation:
    - 'Include JSDoc comments for all functions'
    - 'Maintain up-to-date README files'
  testing:
    - 'Write unit tests for all new features'
    - 'Maintain minimum 80% code coverage'

3. Security Rules

# Security Guidelines
security:
  authentication:
    - 'Implement proper token validation'
    - 'Use environment variables for secrets'
  dataProtection:
    - 'Sanitize all user inputs'
    - 'Implement proper error handling'

Best Practices

Writing Effective Rules

Be Specific
- Use clear, actionable language
- Provide examples where helpful
- Define measurable criteria
Maintain Organization
- Group related rules together
- Use consistent formatting
- Keep critical rules at the top
Regular Updates
- Review rules periodically
- Update based on team feedback
- Document changes in version control

Common Patterns

# Common Patterns Example
patterns:
  components:
    - pattern: 'Use functional components by default'
    - pattern: 'Implement error boundaries for component trees'
  stateManagement:
    - pattern: 'Use React Query for server state'
    - pattern: 'Implement proper loading states'

Integration with Development Workflow

Using with Version Control

Commit the Rules
- Include .clinerules in version control
- Document rule changes in commit messages
- Review rule changes as part of PR process
Team Collaboration
- Discuss rule changes with team
- Maintain changelog for rule updates
- Ensure all team members understand rules

Troubleshooting

Common Issues

Rules Not Being Applied
- Verify file location (must be in root directory)
- Check file formatting
- Ensure Cline has access to the file
Conflicting Rules
- Review rule hierarchy
- Resolve conflicts explicitly
- Document rule precedence
Performance Considerations
- Keep rules concise and focused
- Avoid overly complex rule structures
- Regular cleanup of obsolete rules

Examples

Basic Project Setup

# Basic .clinerules Example
project:
  name: 'Web Application'
  type: 'Next.js Frontend'
  standards:
    - 'Use TypeScript for all new code'
    - 'Follow React best practices'
    - 'Implement proper error handling'

testing:
  unit:
    - 'Jest for unit tests'
    - 'React Testing Library for components'
  e2e:
    - 'Cypress for end-to-end testing'

documentation:
  required:
    - 'README.md in each major directory'
    - 'JSDoc comments for public APIs'
    - 'Changelog updates for all changes'

Advanced Configuration

# Advanced .clinerules Example
project:
  name: 'Enterprise Application'
  compliance:
    - 'GDPR requirements'
    - 'WCAG 2.1 AA accessibility'

architecture:
  patterns:
    - 'Clean Architecture principles'
    - 'Domain-Driven Design concepts'

security:
  requirements:
    - 'OAuth 2.0 authentication'
    - 'Rate limiting on all APIs'
    - 'Input validation with Zod'

State Management Standards for PostgreSQL

Cline

Overview

Key Concepts

Purpose of .clinerules

File Location

Rule Structure

1. Project Overview

2. Code Standards

3. Security Rules

Best Practices

Writing Effective Rules

Common Patterns

Integration with Development Workflow

Using with Version Control

Troubleshooting

Common Issues

Examples

Basic Project Setup

Advanced Configuration

Related Rules

Guidelines for writing Postgres Row Level Security policies

Guidelines for writing Postgres migrations

Guidelines for writing Postgres SQL

API Integration Standards for PostgreSQL

Core Architecture Standards for PostgreSQL