API introduction

This section explains PortfolioOptimisers.jl API in detail. The pages are organised in exactly the same way as the src folder itself. This means there should be a 1 to 1 correspondence between documentation and source files^[1].

Design philosophy

There are three overarching design choices in PortfolioOptimisers.jl:

1. Well-defined type hierarchies

Easily and quickly add new features by sticking to defined interfaces.

2. Strongly typed immutable structs

All types are concrete and known at instantiation.
Constants can be propagated if necessary.
There is always a single immutable source of truth for every process.
If needed, modifying values must be done via interface functions, which simplifies finding and fixing bugs. If the interface for modification is not provided the code will throw a missing method exception.

3. Compositional design

PortfolioOptimisers.jl is a toolkit whose components can interact in complex, deeply nested ways.
Separation of concerns lets us subdivide logical components into isolated, self-contained units. Leading to easier and fearless development and testing.
Extensive and judicious data validation checks are performed at the earliest possible moment–-mostly at variable instantiation–-to ensure correctness.
Turtles all the way down. Structures can be used, reused, and nested in many ways. This allows for efficient data reuse and arbitrary complexity.

Design goals

This philosophy has three primary goals:

1. Maintainability and expandability

The only way to break existing functionality should be by modifying APIs.
Adding functionality should be a case of subtyping existing abstract types and implementing the correct interfaces.
Avoid leaking side effects to other components unless completely necessary. An example of this is entropy pooling requiring the use of a vector of observation weights which must be taken into account in different, largely unrelated places.

2. Correctness and robustness

Each subunit should perform its own data validation as early as possible unless it absolutely needs downstream data.

3. Performance

Types and constants are always fully known at inference time.
Immutability ensures smaller structs live in the stack.

Features

This section is under active development so any [<name>]-(@ref) lacks docstrings.

Preprocessing

Prices to returns prices_to_returns and ReturnsResult
Asset selection ScoreSelector, ZeroVarianceFilter, CompleteAssetSelector

Matrix processing

Positive definite projection Posdef, posdef!, posdef
Denoising Denoise, denoise!, denoise
- Spectral SpectralDenoise
- Fixed FixedDenoise
- Shrunk ShrunkDenoise
Detoning Detone, detone!, detone
Matrix processing pipeline MatrixProcessing, matrix_processing!, matrix_processing_step!, matrix_processing

Regression models

Factor prior models and implied volatility use regression in their estimation, which return a Regression object.

Regression targets

Linear model LinearModel
Generalised linear model GeneralisedLinearModel

Regression types

Stepwise StepwiseRegression
- Algorithms
  - ForwardSelection ForwardSelection
  - BackwardElimination BackwardElimination
- Selection criteria
  - P-value PValue
  - Akaike information criteria AIC
  - Corrected Akaike information criteria AICC
  - Bayesian information criteria BIC
  - R-squared RSquared
  - Adjusted R-squared criteria AdjustedRSquared
Dimensional reduction with custom mean and variance estimators DimensionReductionRegression
- Principal component PCA
- Probabilistic principal component PPCA

Moment estimation

Expected returns

Overloads Statistics.mean.

Optionally weighted expected returns SimpleExpectedReturns
Equilibrium expected returns with custom covariance EquilibriumExpectedReturns
Excess expected returns with custom expected returns estimator ExcessExpectedReturns
Shrunk expected returns with custom expected returns and custom covariance estimators ShrunkExpectedReturns
- Algorithms
  - James-Stein JamesStein
  - Bayes-Stein BayesStein
  - Bodnar-Okhrin-Parolya BodnarOkhrinParolya
- Targets: all algorithms can have any of the following targets
  - Grand Mean GrandMean
  - Volatility Weighted VolatilityWeighted
  - Mean Squared Error MeanSquaredError
Standard deviation expected returns StandardDeviationExpectedReturns

Variance and standard deviation

Overloads Statistics.var and Statistics.std.

Optionally weighted variance with custom expected returns estimator SimpleVariance

Covariance and correlation

Overloads Statistics.cov and Statistics.cor.

Optionally weighted covariance with custom covariance estimator GeneralCovariance
Covariance with custom covariance estimator Covariance
- FullMoment FullMoment
- SemiMoment SemiMoment
Gerber covariances with custom variance and demeaning estimator GerberCovariance
- Gerber 0 Gerber0
- Gerber 1 Gerber1
- Gerber 2 Gerber2
Smyth-Broby extension of Gerber covariances with custom expected returns and variance estimators SmythBrobyCovariance
- Smyth-Broby 0 SmythBroby0
- Smyth-Broby 1 SmythBroby1
- Smyth-Broby 2 SmythBroby2
- Smyth-Broby-Gerber 0 SmythBrobyGerber0
- Smyth-Broby-Gerber 1 SmythBrobyGerber1
- Smyth-Broby-Gerber 2 SmythBrobyGerber2
- Smyth-Broby-Count 0 SmythBrobyCount0
- Smyth-Broby-Count 1 SmythBrobyCount1
- Smyth-Broby-Count 2 SmythBrobyCount2
Gerber Information Quality GerberIQCovariance with custom variance, demeaning, temporal decay and numerator + denominator estimators
- Basic template BasicGerberIQ
- Partial template PartialGerberIQ
- FullMoment template FullGerberIQ
Distance covariance with custom distance estimator via Distances.jl DistanceCovariance
Lower Tail Dependence covariance LowerTailDependenceCovariance
Rank covariances
- Kendall covariance KendallCovariance
- Spearman covariance SpearmanCovariance
Mutual information covariance with custom variance estimator and various binning algorithms MutualInfoCovariance
- Bin-width-rule bins BinWidthBins
  - Knuth's optimal bin width Knuth
  - Freedman Diaconis bin width FreedmanDiaconis
  - Scott's bin width Scott
- Hacine-Gharbi-Ravier bin width HacineGharbiRavier
- Predefined number of bins
Denoised covariance with custom covariance estimator DenoiseCovariance
Detoned covariance with custom covariance estimator DetoneCovariance
Custom processed covariance with custom covariance estimator ProcessedCovariance
Implied volatility with custom covariance and matrix processing estimators, and implied volatility algorithms ImpliedVolatility
- Premium ImpliedVolatilityPremium
- Regression ImpliedVolatilityRegression
Covariance with custom covariance estimator and matrix processing pipeline PortfolioOptimisersCovariance
Correlation covariance CorrelationCovariance

Coskewness

Implements coskewness.

Coskewness and spectral decomposition of the negative coskewness with custom expected returns estimator and matrix processing pipeline Coskewness
- FullMoment FullMoment
- SemiMoment SemiMoment

Cokurtosis

Implements cokurtosis.

Cokurtosis with custom expected returns estimator and matrix processing pipeline Cokurtosis
- FullMoment FullMoment
- SemiMoment SemiMoment

Distance matrices

Implements distance and cor_and_dist.

First order distance estimator with custom distance algorithm, and optional exponent Distance
Second order distance estimator with custom pairwise distance algorithm from Distances.jl, custom distance algorithm, and optional exponent DistanceDistance

The distance estimators are used together with various distance matrix algorithms.

Simple distance SimpleDistance
Simple absolute distance SimpleAbsoluteDistance
Logarithmic distance LogDistance
Correlation distance CorrelationDistance
Variation of Information distance with various binning algorithms VariationInfoDistance
- Bin-width-rule bins BinWidthBins
  - Knuth's optimal bin width Knuth
  - Freedman Diaconis bin width FreedmanDiaconis
  - Scott's bin width Scott
- Hacine-Gharbi-Ravier bin width HacineGharbiRavier
- Predefined number of bins
Canonical distance CanonicalDistance

Phylogeny

PortfolioOptimisers.jl can make use of asset relationships to perform optimisations, define constraints, and compute relatedness characteristics of portfolios.

Clustering

Phylogeny constraints and clustering optimisations make use of clustering algorithms via ClustersEstimator, Clusters, and clusterise. Most clustering algorithms come from Clustering.jl.

Automatic choice of number of clusters via OptimalNumberClusters and VectorToScalarMeasure
- Second order difference SecondOrderDifference
- Silhouette scores SilhouetteScore
- Predefined number of clusters.

Hierarchical

Hierarchical clustering HClustAlgorithm
Direct Bubble Hierarchical Trees DBHT and Local Global sparsification of the covariance matrix LoGo, logo!, and logo

Non-hierarchical

Non-hierarchical clustering algorithms are incompatible with hierarchical clustering optimisations, but they can be used for phylogeny constraints and NestedClustered optimisations.

K-means clustering KMeansAlgorithm

Networks

Adjacency matrices

Adjacency matrices encode asset relationships either with clustering or graph theory via phylogeny_matrix and PhylogenyResult.

Network adjacency NetworkEstimator with custom tree algorithms, covariance, and distance estimators
- Minimum spanning trees KruskalTree, BoruvkaTree, PrimTree
- Triangulated Maximally Filtered Graph with various similarity matrix estimators
  - Maximum distance similarity MaximumDistanceSimilarity
  - Exponential similarity ExponentialSimilarity
  - General exponential similarity GeneralExponentialSimilarity
Clustering adjacency ClustersEstimator and Clusters

Centrality and phylogeny measures

Centrality estimator CentralityEstimator with custom adjacency matrix estimators (clustering and network) and centrality measures
- Betweenness BetweennessCentrality
- Closeness ClosenessCentrality
- Degree DegreeCentrality
- Eigenvector EigenvectorCentrality
- Katz KatzCentrality
- Pagerank Pagerank
- Radiality RadialityCentrality
- Stress StressCentrality
Centrality vector centrality_vector
Average centrality average_centrality
The asset phylogeny score asset_phylogeny

Optimisation constraints

Non clustering optimisers support a wide range of constraints, while naive and clustering optimisers only support weight bounds. Furthermore, entropy pooling prior supports a variety of views constraints. It is therefore important to provide users with the ability to generate constraints manually and/or programmatically. We therefore provide a wide, robust, and extensible range of types such as AbstractEstimatorValueAlgorithm and UniformValues, and functions that make this easy, fast, and safe.

Constraints can be defined via their estimators or directly by their result types. Some using estimators need to map key-value pairs to the asset universe, this is done by defining the assets and asset groups in AssetSets. Internally, PortfolioOptimisers.jl uses all the information and calls group_to_val!, and replace_group_by_assets to produce the appropriate arrays.

Equation parsing parse_equation and ParsingResult.
Linear constraints linear_constraints, LinearConstraintEstimator, PartialLinearConstraint, and LinearConstraint
Risk budgeting constraints risk_budget_constraints, RiskBudgetEstimator, and RiskBudget
Phylogeny constraints phylogeny_constraints, centrality_constraints, SemiDefinitePhylogenyEstimator, SemiDefinitePhylogeny, IntegerPhylogenyEstimator, IntegerPhylogeny, CentralityConstraint
Weight bounds constraints weight_bounds_constraints, WeightBoundsEstimator, WeightBounds
Asset set matrices asset_sets_matrix and AssetSetsMatrixEstimator
Threshold constraints threshold_constraints, ThresholdEstimator, and Threshold

Prior statistics

Many optimisations and constraints use prior statistics computed via prior.

Low order prior LowOrderPrior
- Empirical EmpiricalPrior
- Factor model FactorPrior
- Black-Litterman
  - Vanilla BlackLittermanPrior
  - Bayesian BayesianBlackLittermanPrior
  - Factor model FactorBlackLittermanPrior
  - Augmented AugmentedBlackLittermanPrior
- Entropy pooling EntropyPoolingPrior
- Opinion pooling OpinionPoolingPrior
High order prior HighOrderPrior
- High order HighOrderPriorEstimator
- High order factor model HighOrderFactorPriorEstimator

Uncertainty sets

In order to make optimisations more robust to noise and measurement error, it is possible to define uncertainty sets on the expected returns and covariance. These can be used in optimisations which use either of these two quantities. These are implemented via ucs, mu_ucs, and sigma_ucs.

PortfolioOptimisers.jl implements two types of uncertainty sets.

BoxUncertaintySet and BoxUncertaintySetAlgorithm
EllipsoidalUncertaintySet and EllipsoidalUncertaintySetAlgorithm with various algorithms for computing the scaling parameter via k_ucs
- NormalKUncertaintyAlgorithm
- GeneralKUncertaintyAlgorithm
- ChiSqKUncertaintyAlgorithm
- Predefined scaling parameter

It also implements various estimators for the uncertainty sets, the following two can generate box and ellipsoidal sets.

Normally distributed returns NormalUncertaintySet
Bootstrapping via Autoregressive Conditional Heteroscedasticity ARCHUncertaintySet via arch
- Circular CircularBootstrap
- Moving MovingBootstrap
- Stationary StationaryBootstrap

The following estimator can only generate box sets.

DeltaUncertaintySet

Turnover

The turnover is defined as the element-wise absolute difference between the vector of current weights and a vector of benchmark weights. It can be used as a constraint, method for fee calculation, and risk measure. These are all implemented using turnover_constraints, TurnoverEstimator, and Turnover.

Fees

Fees are a non-negligible aspect of active investing. As such PortfolioOptimiser.jl has the ability to account for them in all optimisations but the naive ones. They can also be used to adjust expected returns calculations via calc_fees and calc_asset_fees.

Fees FeesEstimator and Fees
- Proportional long
- Proportional short
- Fixed long
- Fixed short
- Turnover

Portfolio returns and drawdowns

Various risk measures and analyses require the computation of simple and cumulative portfolio returns and drawdowns both in aggregate and per-asset. These are computed by calc_net_returns, calc_net_asset_returns, cumulative_returns, drawdowns.

Tracking

It is often useful to create portfolios that track the performance of an index, indicator, or another portfolio.

Tracking error tracking_benchmark, TrackingError
- Returns tracking ReturnsTracking
- Weights tracking WeightsTracking

The error can be computed using different algorithms using norm_error.

Norm tracking algorithms
- L1-norm L1Norm
- L2-norm L2Norm
- L2-norm squared SquaredL2Norm
- Lp-norm LpNorm
- L-Inf-norm LInfNorm

It is also possible to track the error in with risk measures RiskTrackingError using WeightsTracking, which allows for two approaches.

Dependent variable tracking DependentVariableTracking
Independent variable tracking IndependentVariableTracking

Risk measures

PortfolioOptimisers.jl provides a wide range of risk measures. These are broadly categorised into two types based on the type of optimisations that support them.

Risk measures for traditional optimisation

These are all subtypes of RiskMeasure, and are supported by all optimisation estimators.

Variance [Variance]
- Traditional optimisations also support:
  - Risk contribution
  - Formulations
    Quadratic risk expression QuadRiskExpr
    Squared second order cone SquaredSOCRiskExpr
Standard deviation StandardDeviation
Uncertainty set variance UncertaintySetVariance (same as variance when used in non-traditional optimisation)
Low order moments LowOrderMoment
- First lower moment FirstLowerMoment
- Mean absolute deviation MeanAbsoluteDeviation
- Second moment SecondMoment
  - Second squared moments
    Scenario variance FullMoment
    Scenario semi-variance SemiMoment
    Traditional optimisation formulations
    Quadratic risk expression QuadRiskExpr
    Squared second order cone SquaredSOCRiskExpr
    Rotated second order cone RSOCRiskExpr
  - Second moments SOCRiskExpr
    Scenario standard deviation FullMoment
    Scenario semi-standard deviation SemiMoment
Kurtosis Kurtosis
- Actual kurtosis
  - FullMoment and semi-kurtosis are supported in traditional optimisers via the kt field. Risk calculation uses
    FullMoment FullMoment
    SemiMoment SemiMoment
  - Traditional optimisation formulations
    Quadratic risk expression QuadRiskExpr
    Squared second order cone SquaredSOCRiskExpr
    Rotated second order cone RSOCRiskExpr
- Square root kurtosis SOCRiskExpr
  - FullMoment FullMoment
  - SemiMoment SemiMoment
Negative skewness NegativeSkewness
- Squared negative skewness
  - FullMoment and semi-skewness are supported in traditional optimisers via the sk and V fields. Risk calculation uses
    FullMoment FullMoment
    SemiMoment SemiMoment
  - Traditional optimisation formulations
    Quadratic risk expression QuadRiskExpr
    Squared second order cone SquaredSOCRiskExpr
  - Square root negative skewness SOCRiskExpr
Value at Risk ValueatRisk
- Traditional optimisation formulations
  - Exact MIP formulation MIPValueatRisk
  - Approximate distribution based DistributionValueatRisk
Value at Risk Range ValueatRiskRange
- Traditional optimisation formulations
  - Exact MIP formulation MIPValueatRisk
  - Approximate distribution based DistributionValueatRisk
Drawdown at Risk DrawdownatRisk
Conditional Value at Risk ConditionalValueatRisk
Distributionally Robust Conditional Value at Risk DistributionallyRobustConditionalValueatRisk (same as conditional value at risk when used in non-traditional optimisation)
Conditional Value at Risk Range ConditionalValueatRiskRange
Distributionally Robust Conditional Value at Risk Range DistributionallyRobustConditionalValueatRiskRange (same as conditional value at risk range when used in non-traditional optimisation)
Conditional Drawdown at Risk ConditionalDrawdownatRisk
Distributionally Robust Conditional Drawdown at Risk DistributionallyRobustConditionalDrawdownatRisk(same as conditional drawdown at risk when used in non-traditional optimisation)
Entropic Value at Risk EntropicValueatRisk
Entropic Value at Risk Range EntropicValueatRiskRange
Entropic Drawdown at Risk EntropicDrawdownatRisk
Relativistic Value at Risk RelativisticValueatRisk
Relativistic Value at Risk Range RelativisticValueatRiskRange
Relativistic Drawdown at Risk RelativisticDrawdownatRisk
Ordered Weights Array
- Risk measures
  - Ordered Weights Array risk measure OrderedWeightsArray
  - Ordered Weights Array range risk measure OrderedWeightsArrayRange
- Traditional optimisation formulations
  - Exact ExactOrderedWeightsArray
  - Approximate ApproxOrderedWeightsArray
- Array functions
  - Gini Mean Difference owa_gmd
  - Worst Realisation owa_wr
  - Range owa_rg
  - Conditional Value at Risk owa_cvar
  - Weighted Conditional Value at Risk owa_wcvar
  - Conditional Value at Risk Range owa_cvarrg
  - Weighted Conditional Value at Risk Range owa_wcvarrg
  - Tail Gini owa_tg
  - Tail Gini Range owa_tgrg
  - Linear moments (L-moments)
    Linear Moment owa_l_moment
    Linear Moment Convex Risk Measure owa_l_moment_crm
    L-moment combination formulations
    Maximum Entropy MaximumEntropy
    Exponential Cone Entropy ExponentialConeEntropy
    Relative Entropy RelativeEntropy
    Minimum Squared Distance MinimumSquaredDistance
    Minimum Sum Squares MinimumSumSquares
Average Drawdown AverageDrawdown
Ulcer Index UlcerIndex
Maximum Drawdown MaximumDrawdown
Brownian Distance Variance BrownianDistanceVariance
- Traditional optimisation formulations
  - Distance matrix constraint formulations
    Norm one cone Brownian distance variance NormOneConeBrownianDistanceVariance
    Inequality Brownian distance variance IneqBrownianDistanceVariance
  - Risk formulation
    Quadratic risk expression QuadRiskExpr
    Rotated second order cone RSOCRiskExpr
Worst Realisation WorstRealisation
Range Range
Turnover Risk Measure TurnoverRiskMeasure
Tracking Risk Measure TrackingRiskMeasure
- L1-norm L1Norm
- L2-norm L2Norm
- L2-norm squared SquaredL2Norm
- Lp-norm LpNorm
- L-Inf-norm LInfNorm
Risk Tracking Risk Measure
- Dependent variable tracking DependentVariableTracking
- Independent variable tracking IndependentVariableTracking
Power Norm Value at Risk PowerNormValueatRisk
Power Norm Value at Risk Range PowerNormValueatRiskRange
Power Norm Drawdown at Risk PowerNormDrawdownatRisk

Risk measures for hierarchical optimisation

These are all subtypes of HierarchicalRiskMeasure, and are only supported by hierarchical optimisation estimators.

High order moment HighOrderMoment
- Unstandardised third lower moment ThirdLowerMoment
- Standardised third lower moment StandardisedHighOrderMoment and ThirdLowerMoment
- Unstandardised fourth moment FourthMoment
  - FullMoment FullMoment
  - SemiMoment SemiMoment
- Standardised fourth moment StandardisedHighOrderMoment and FourthMoment
  - FullMoment FullMoment
  - SemiMoment SemiMoment
Relative Drawdown at Risk RelativeDrawdownatRisk
Relative Conditional Drawdown at Risk RelativeConditionalDrawdownatRisk
Relative Entropic Drawdown at Risk RelativeEntropicDrawdownatRisk
Relative Relativistic Drawdown at Risk RelativeRelativisticDrawdownatRisk
Relative Average Drawdown RelativeAverageDrawdown
Relative Ulcer Index RelativeUlcerIndex
Relative Maximum Drawdown RelativeMaximumDrawdown
Relative Power Norm Drawdown at Risk RelativePowerNormDrawdownatRisk
Risk Ratio Risk Measure RiskRatio
Equal Risk Measure EqualRisk
Median Absolute Deviation MedianAbsoluteDeviation

Non-optimisation risk measures

These risk measures are unsuitable for optimisation because they can return negative values. However, they can be used for performance metrics.

Mean Return MeanReturn
Third Central Moment [ThirdCentralMoment]-@(ref)
Skewness Skewness
Return Risk Measure ExpectedReturn
Return Risk Ratio Risk Measure ExpectedReturnRiskRatio

Performance metrics

Expected risk expected_risk
Number of effective assets number_effective_assets
Risk contribution
- Asset risk contribution risk_contribution
- Factor risk contribution factor_risk_contribution
Expected return expected_return
- Arithmetic ArithmeticReturn
- Logarithmic LogarithmicReturn
Expected risk-adjusted return ratio expected_ratio and expected_risk_ret_ratio
Expected risk-adjusted ratio information criterion expected_sric and expected_risk_ret_sric
Brinson performance attribution brinson_attribution

Portfolio optimisation

Optimisations are implemented via optimise. Optimisations consume an estimator and return a result.

Naive

These return a NaiveOptimisationResult.

Inverse Volatility InverseVolatility
Equal Weighted EqualWeighted
Random (Dirichlet) RandomWeighted

Naive optimisation features

Weight bounds WeightBoundsEstimator, UniformValues, and WeightBounds
Weight finalisers
- Iterative Weight Finaliser IterativeWeightFinaliser
- JuMP Weight Finaliser JuMPWeightFinaliser
  - Relative Error Weight Finaliser RelativeErrorWeightFinaliser
  - Squared Relative Error Weight Finaliser SquaredRelativeErrorWeightFinaliser
  - Absolute Error Weight Finaliser AbsoluteErrorWeightFinaliser
  - Squared Absolute Error Weight Finaliser SquaredAbsoluteErrorWeightFinaliser

Traditional

These optimisations are implemented as JuMP problems and make use of JuMPOptimiser, which encodes all supported constraints.

Objective function optimisations

These optimisations support a variety of objective functions.

Objective functions
- Minimum risk MinimumRisk
- Maximum utility MaximumUtility
- Maximum return over risk ratio MaximumRatio
- Maximum return MaximumReturn
Exclusive to MeanRisk and NearOptimalCentering
- N-dimensional Pareto fronts Frontier
  - Return based
  - Risk based
Optimisation estimators
- Mean-Risk MeanRisk returns a MeanRiskResult
- Near Optimal Centering NearOptimalCentering returns a NearOptimalCenteringResult
- Factor Risk Contribution FactorRiskContribution returns a FactorRiskContributionResult

Risk budgeting optimisations

These optimisations attempt to achieve weight values according to a risk budget vector. This vector can be provided on a per asset or per factor basis.

Budget targets
- Asset risk budgeting AssetRiskBudgeting
- Fromulations
  - Log-barrier risk budgeting LogRiskBudgeting
  - MIP asset risk bugeting MixedIntegerRiskBudgeting
- Factor risk budgeting FactorRiskBudgeting
Optimisation estimators
- Risk Budgeting RiskBudgeting returns a RiskBudgetingResult
- Relaxed Risk Budgeting RelaxedRiskBudgeting returns a RiskBudgetingResult
  - Basic BasicRelaxedRiskBudgeting
  - Regularised RegularisedRelaxedRiskBudgeting
  - Regularised and penalised RegularisedPenalisedRelaxedRiskBudgeting

Traditional optimisation features

Custom objective penalty CustomJuMPObjective
Weight bounds WeightBoundsEstimator, UniformValues, and WeightBounds
Budget
- Directionality
  - Long
  - Short
- Type
  - Exact
  - Range BudgetRange
Threshold ThresholdEstimator and Threshold
- Directionality
  - Long
  - Short
- Type
  - Asset
  - Set AssetSetsMatrixEstimator
Linear constraints LinearConstraintEstimator and LinearConstraint
Centralit(y/ies) CentralityEstimator
Cardinality
- Asset
- Asset group(s) LinearConstraintEstimator and LinearConstraint
- Set(s)
- Set group(s) LinearConstraintEstimator and LinearConstraint
Turnover(s) TurnoverEstimator and Turnover
Fees FeesEstimator and Fees
Tracking error(s) TrackingError
Phylogen(y/ies) IntegerPhylogenyEstimator and SemiDefinitePhylogenyEstimator
Portfolio returns
- Arithmetic returns ArithmeticReturn
  - Uncertainty set BoxUncertaintySet, BoxUncertaintySetAlgorithm, EllipsoidalUncertaintySet, and EllipsoidalUncertaintySetAlgorithm
  - Custom expected returns vector
- Logarithmic returns LogarithmicReturn
Risk vector scalarisation
- Weighted sum SumScalariser
- Maximum value MaxScalariser
- Log-sum-exp LogSumExpScalariser
Custom constraint
Number of effective assets
Regularisation penalty
- L1
- L2
- Lp LpRegularisation
- L-Inf

Clustering optimisation

Clustering optimisations make use of asset relationships to either minimise the risk exposure by breaking the asset universe into subsets which are hierarchically or individually optimised.

Hierarchical clustering optimisation

These optimisations minimise risk by hierarchically splitting the asset universe into subsets, computing the risk of each subset, and combining them according to their hierarchy.

Hierarchical Risk Parity HierarchicalRiskParity returns a HierarchicalResult
Hierarchical Equal Risk Contribution HierarchicalEqualRiskContribution returns a HierarchicalResult

Hierarchical clustering optimisation features

Weight bounds WeightBoundsEstimator, UniformValues, and WeightBounds
Fees FeesEstimator and Fees
Risk vector scalarisation
- Weighted sum SumScalariser
- Maximum value MaxScalariser
- Log-sum-exp LogSumExpScalariser
Weight finalisers
- Iterative Weight Finaliser IterativeWeightFinaliser
- JuMP Weight Finaliser JuMPWeightFinaliser
  - Relative Error Weight Finaliser RelativeErrorWeightFinaliser
  - Squared Relative Error Weight Finaliser SquaredRelativeErrorWeightFinaliser
  - Absolute Error Weight Finaliser AbsoluteErrorWeightFinaliser
  - Squared Absolute Error Weight Finaliser SquaredAbsoluteErrorWeightFinaliser

Schur complementary optimisation

Schur complementary hierarchical risk parity provides a bridge between mean variance optimisation and hierarchical risk parity by using an interpolation parameter. It converges to hierarchical risk parity, and approximates mean variance by adjusting this parameter. It uses the Schur complement to adjust the weights of a portfolio according to how much more useful information is gained by assigning more weight to a group of assets.

Schur Complementary Hierarchical Risk Parity SchurComplementHierarchicalRiskParity returns a SchurComplementHierarchicalRiskParityResult

Schur complementary optimisation features

Weight bounds WeightBoundsEstimator, UniformValues, and WeightBounds
Fees FeesEstimator and Fees
Weight finalisers
- Iterative Weight Finaliser IterativeWeightFinaliser
- JuMP Weight Finaliser JuMPWeightFinaliser
  - Relative Error Weight Finaliser RelativeErrorWeightFinaliser
  - Squared Relative Error Weight Finaliser SquaredRelativeErrorWeightFinaliser
  - Absolute Error Weight Finaliser AbsoluteErrorWeightFinaliser
  - Squared Absolute Error Weight Finaliser SquaredAbsoluteErrorWeightFinaliser

Nested clusters optimisation

Nested clustered optimisation breaks the asset universe of size N into C smaller subsets and treats every subset as an individual portfolio. The weights assigned to each asset are placed in an N × C matrix. In each column, non-zero values correspond to assets assigned to that subset, this means that assets only contribute to the column (and therefore synthetic asset) corresponding to their assigned subset. In other words, each row of the matrix contains a single non-zero value and each row contains as many non-zero values as there are assets in that subset.

From here there are two options: 2. Compute the returns matrix of the synthetic assets directly by multiplying the original T × N matrix by the N × C matrix of asset weights to produce a T × C matrix of predicted returns, where T is the number of observations.

For each subset perform a cross validation prediction, yielding a vector of returns for that subset. These vectors are then horizontally concatenated into a Y × C matrix of cross-validation predicted returns, where Y ≤ T because the cross validation may not use the full history.

This matrix of predicted returns is then used by the outer optimisation estimator to generate an optimisation of the synthetic assets. This produces a C × 1 vector, essentially optimising a portfolio of asset clusters. The final weights are the product of the original N × C matrix of asset weights per cluster by the C × 1 vector of optimal synthetic asset weights to produce the final N × 1 vector of asset weights.

Nested Clustered NestedClustered returns a NestedClusteredResult

Nested clusters optimisation features

Any features supported by the inner and outer estimators.
Weight bounds WeightBoundsEstimator, UniformValues, and WeightBounds
Fees FeesEstimator and Fees
Weight finalisers
- Iterative Weight Finaliser IterativeWeightFinaliser
- JuMP Weight Finaliser JuMPWeightFinaliser
  - Relative Error Weight Finaliser RelativeErrorWeightFinaliser
  - Squared Relative Error Weight Finaliser SquaredRelativeErrorWeightFinaliser
  - Absolute Error Weight Finaliser AbsoluteErrorWeightFinaliser
  - Squared Absolute Error Weight Finaliser SquaredAbsoluteErrorWeightFinaliser
Cross validation predictor for the outer estimator

Ensemble optimisation

This works similarly to the Nested Clustered estimator, only instead of breaking the asset universe into subsets, a list of inner estimators is provided. The procedure is then exactly the same as the nested clusters optimisation, only instead of an N × C matrix of asset weights where each column corresponds to a subset of assets, each column corresponds to a completely independent and isolated inner estimator, which also means there is no enforced sparsity pattern on this matrix.

Stacking Stacking returns a StackingResult

Ensemble optimisation features

Any features supported by the inner and outer estimators.
Fees FeesEstimator and Fees
Weight bounds WeightBoundsEstimator, UniformValues, and WeightBounds
Weight finalisers
- Iterative Weight Finaliser IterativeWeightFinaliser
- JuMP Weight Finaliser JuMPWeightFinaliser
  - Relative Error Weight Finaliser RelativeErrorWeightFinaliser
  - Squared Relative Error Weight Finaliser SquaredRelativeErrorWeightFinaliser
  - Absolute Error Weight Finaliser AbsoluteErrorWeightFinaliser
  - Squared Absolute Error Weight Finaliser SquaredAbsoluteErrorWeightFinaliser
Cross validation predictor for the outer estimator

Subset resampling optimisation

This optimiser takes ideas from MultipleRandomised cross validation to randomly sample the asset universe and optimise each sample individually using a given optimiser. The final asset weights are the average weight per asset across all samples, if an asset does not appear in a sample, it is taken to be zero.

SubsetResampling returns a SubsetResamplingResult

Subset resampling optimisation features

Any features supported by the inner estimator.
Fees FeesEstimator and Fees
Weight bounds WeightBoundsEstimator, UniformValues, and WeightBounds
Weight finalisers
- Iterative Weight Finaliser IterativeWeightFinaliser
- JuMP Weight Finaliser JuMPWeightFinaliser
  - Relative Error Weight Finaliser RelativeErrorWeightFinaliser
  - Squared Relative Error Weight Finaliser SquaredRelativeErrorWeightFinaliser
  - Absolute Error Weight Finaliser AbsoluteErrorWeightFinaliser
  - Squared Absolute Error Weight Finaliser SquaredAbsoluteErrorWeightFinaliser

Finite allocation optimisation

Unlike all other estimators, finite allocation does not yield an "optimal" value, but rather the optimal attainable solution based on a finite amount of capital. They use the result of other estimations, the latest prices, and a cash amount.

Discrete (MIP) DiscreteAllocation
- Weight finalisers
  - Iterative Weight Finaliser IterativeWeightFinaliser
  - JuMP Weight Finaliser JuMPWeightFinaliser
    Relative Error Weight Finaliser RelativeErrorWeightFinaliser
    Squared Relative Error Weight Finaliser SquaredRelativeErrorWeightFinaliser
    Absolute Error Weight Finaliser AbsoluteErrorWeightFinaliser
    Squared Absolute Error Weight Finaliser SquaredAbsoluteErrorWeightFinaliser
Greedy [GreedyAllocation]

Cross validation

Prediction on unseen data PredictionReturnsResult, PredictionResult, MultiPeriodPredictionResult, PopulationPredictionResult via predict(res::NonFiniteAllocationOptimisationResult, rd::ReturnsResult), fit_and_predict
Prediction scoring via PredictionCrossValScorer, NearestQuantilePrediction, and quantile_by_measure
Cross validation estimators used via split and fit_and_predict
- K-Fold KFold returns a KFoldResult
- Combinatorial CombinatorialCrossValidation returns a CombinatorialCrossValidationResult
- Walk forward WalkForwardEstimator return a WalkForwardResult
  - Index-based IndexWalkForward, DateWalkForward
- Multiple randomised MultipleRandomised returns a MultipleRandomisedResult
Hyperparameter tuning via search_cross_validation.
- Grid search cross validation GridSearchCrossValidation
- Randomised search cross validation RandomisedSearchCrossValidation

Plotting

Visualising the results is quite a useful way of summarising the portfolio characteristics or evolution. To this extent we provide a few plotting functions with more to come.

Simple or compound cumulative returns.
- Portfolio plot_ptf_cumulative_returns.
- Assets plot_asset_cumulative_returns.
Portfolio composition.
- Single portfolio plot_composition.
- Multi portfolio.
  - Stacked bar plot_stacked_bar_composition.
  - Stacked area plot_stacked_area_composition.
Risk contribution.
- Asset risk contribution plot_risk_contribution.
- Factor risk contribution plot_factor_risk_contribution.
Asset dendrogram plot_dendrogram.
Asset clusters + optional dendrogram plot_clusters.
Simple or compound drawdowns plot_drawdowns.
Portfolio returns histogram + density plot_histogram.
2/3D risk measure scatter plots plot_measures.

Except for a few cases, most of which are convenience function overloads. This means some links do not go to the exact method definition. Other than hard-coding links to specific lines of code, which is fragile, I haven't found an easy solution. ↩︎

API introduction ​

Design philosophy ​

1. Well-defined type hierarchies ​

2. Strongly typed immutable structs ​

3. Compositional design ​

Design goals ​

1. Maintainability and expandability ​

2. Correctness and robustness ​

3. Performance ​

Features ​

Preprocessing ​

Matrix processing ​

Regression models ​

Regression targets ​

Regression types ​

Moment estimation ​

Expected returns ​

Variance and standard deviation ​

Covariance and correlation ​

Coskewness ​

Cokurtosis ​

Distance matrices ​

Phylogeny ​

Clustering ​

Hierarchical ​

Non-hierarchical ​

Networks ​

Adjacency matrices ​

Centrality and phylogeny measures ​

Optimisation constraints ​

Prior statistics ​

Uncertainty sets ​

Turnover ​

Fees ​

Portfolio returns and drawdowns ​

Tracking ​

Risk measures ​

Risk measures for traditional optimisation ​

Risk measures for hierarchical optimisation ​

Non-optimisation risk measures ​

Performance metrics ​

Portfolio optimisation ​

Naive ​

Naive optimisation features ​

Traditional ​

Objective function optimisations ​

Risk budgeting optimisations ​

Traditional optimisation features ​

Clustering optimisation ​

Hierarchical clustering optimisation ​

Hierarchical clustering optimisation features ​

Schur complementary optimisation ​

Schur complementary optimisation features ​

Nested clusters optimisation ​

Nested clusters optimisation features ​

Ensemble optimisation ​

Ensemble optimisation features ​

Subset resampling optimisation ​

Subset resampling optimisation features ​

Finite allocation optimisation ​

Cross validation ​

Plotting ​