KlongPy: PyTorch Back End and Autograd

PyTorch Backend & Autograd - KlongPy

Initializing search

briangu/klongpy

Backend Comparison

Performance

Automatic Differentiation

GPU Acceleration

Mixing with Python

Best Practices

Function Compilation

Gradient Verification

Troubleshooting

Performance

Reference

Data & Systems

Backend Comparison

Performance

Automatic Differentiation

GPU Acceleration

Mixing with Python

Best Practices

Function Compilation

Gradient Verification

Troubleshooting

PyTorch Backend and Autograd¶

KlongPy supports multiple array backends. The PyTorch backend enables GPU acceleration and automatic differentiation (autograd) for gradient-based computations.

Enabling the PyTorch Backend¶

Command Line¶

# Use --backend flag kgpy --backend torch

# With GPU device selection kgpy --backend torch --device cuda

Programmatically¶

from klongpy import KlongInterpreter

# Create interpreter with torch backend klong = KlongInterpreter(backend="torch") print(klong._backend.name) # 'torch'

# With specific device klong = KlongInterpreter(backend="torch", device="cuda")

Backend Comparison¶

Feature NumPy Backend PyTorch Backend

Default Yes No (use --backend torch)

Object dtype Yes No

String operations Yes Not supported

GPU acceleration No Yes (CUDA/MPS)

Autograd Numeric only Native autograd

Small array performance Faster Slightly slower

Large array performance Good Better (especially on GPU)

Performance¶

The torch backend excels with large arrays:

Benchmark NumPy Torch Winner vector_add_100K 0.04ms 0.08ms NumPy (2x) vector_add_1M 0.36ms 0.07ms Torch (5x) compound_expr_1M 0.61ms 0.07ms Torch (8x) grade_up_100K 0.59ms 0.19ms Torch (3x)

For small arrays (¶

KlongPy provides several gradient and differentiation operators:

Typing Special Characters¶

Symbol Name Mac Windows

Nabla Character Viewer (Ctrl+Cmd+Space) Alt+8711

Partial Option + d Alt+8706

On Mac, ∂ can be typed directly with Option + d . For ∇, use the Character Viewer or copy-paste.

:> Autograd Operator (Recommended)¶

The :> operator uses PyTorch autograd for exact gradients:

f::{x^2} :" Define f(x) = x^2 f:>3 :" Compute f'(3) = 6.0

The syntax is function:>point where: - function is a scalar-valued function (must return a single number) - point is the input at which to compute the gradient

∇ Numeric Gradient Operator¶

The ∇ operator always uses numeric differentiation (finite differences), regardless of backend:

f::{x^2} :" Define f(x) = x^2 3∇f :" Compute f'(3) ≈ 6.0

The syntax is point∇function (note: reversed order from :>).

How They Work¶

Operator Method Precision Speed

:> with torch PyTorch autograd Exact Fast

:> without torch Numeric ~1e-6 error Slower

∇ (any backend) Always numeric ~1e-6 error Slower

With the torch backend (--backend torch or backend='torch'), prefer :> for: - Exact gradients (no floating-point approximation error) - Complex computational graphs - Better performance on large arrays

Examples¶

Scalar function: f::{x^3} :" f(x) = x^3 f:>2 :" f'(2) = 3*4 = 12.0

Polynomial: p::{((3*x^4)-(2*x^2))+x} :" p(x) = 3x^4 - 2x^2 + x p:>1 :" p'(1) = 12 - 4 + 1 = 9.0

Vector function (sum of squares): g::{+/x^2} :" g(x) = sum(x_i^2) g:>[1.0 2.0 3.0] :" [2 4 6] = 2*x

Gradient descent: f::{x^2} x::5.0 lr::0.1

:" Update rule: x = x - lr * grad x::x-(lr*f:>x)

Multi-Parameter Gradients¶

Compute gradients for multiple parameters simultaneously using a list of symbols:

w::2.0 b::3.0 loss::{(w^2)+(b^2)}

:" Compute gradients for both w and b grads::loss:>[w b] :" [4.0 6.0] = [2w, 2b]

This is especially useful for neural network training:

w::1.0 b::0.0 X::[1 2 3] Y::[3 5 7]

:" MSE loss loss::{(+/((w*X)+b-Y)^2)%3}

:" Compute both gradients in one call grads::loss:>[w b]

Jacobian Computation¶

Compute the Jacobian matrix (matrix of partial derivatives) using the ∂ operator or .jacobian() function:

f::{x^2} :" Element-wise square

:" Using ∂ operator (point∂function) [1 2]∂f :" [[2 0] [0 4]] diagonal matrix

:" Using .jacobian() function .jacobian(f;[1 2]) :" Same result

For vector-valued functions f: R^n -> R^m, the Jacobian is an m x n matrix where J[i,j] = df_i/dx_j.

Multi-Parameter Jacobians¶

Just like gradients, you can compute Jacobians with respect to multiple parameters using a list of symbols:

w::[1.0 2.0] b::[3.0 4.0] f::{w^2} :" Returns [w0^2, w1^2]

:" Compute Jacobians for both w and b jacobians::[w b]∂f :" Returns [J_w, J_b]

This returns a list of Jacobian matrices, one per parameter. Useful for analyzing how vector-valued functions depend on multiple parameter sets.

Custom Optimizers¶

KlongPy provides the gradient primitives (:>, ∂, .jacobian()). For optimizers, use the example classes in examples/autograd/optimizers.py which you can copy to your project and customize.

Manual gradient descent (no optimizer needed): w::10.0 loss::{w^2} lr::0.1

:" Update rule: w = w - lr * gradient {w::w-(lr*loss:>w)}'!50 w...

KlongPy: PyTorch Back End and Autograd

Related Articles

Amazon, Facebook, FBI have access to a private intelligence-sharing network

SpaceX not the behemoth everyone thought

The Mirror Is Part of the Machine

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits