CWE-41

Improper Resolution of Path Equivalence

AI Translation Available

The product is vulnerable to file system contents disclosure through path equivalence. Path equivalence involves the use of special characters in file and directory names. The associated manipulations are intended to generate multiple names for the same object.

Status

incomplete

Abstraction

base

Affected Platforms

Extended Description

AI Translation

Path equivalence is usually employed in order to circumvent access controls expressed using an incomplete set of file name or file path representations. This is different from path traversal, wherein the manipulations are performed to generate a name for a different object.

Technical Details

AI Translation

Common Consequences

confidentiality integrity access control

Impacts

read files or directories modify files or directories bypass protection mechanism

Detection Methods

automated static analysis - binary or bytecode manual static analysis - binary or bytecode dynamic analysis with automated results interpretation dynamic analysis with manual results interpretation manual static analysis - source code automated static analysis - source code architecture or design review

Potential Mitigations

Phases:

implementation

Descriptions:

• Assume all input is malicious. Use an "accept known good" input validation strategy, i.e., use a list of acceptable inputs that strictly conform to specifications. Reject any input that does not strictly conform to specifications, or transform it into something that does. When performing input validation, consider all potentially relevant properties, including length, type of input, the full range of acceptable values, missing or extra inputs, syntax, consistency across related fields, and conformance to business rules. As an example of business rule logic, "boat" may be syntactically valid because it only contains alphanumeric characters, but it is not valid if the input is only expected to contain colors such as "red" or "blue." Do not rely exclusively on looking for malicious or malformed inputs. This is likely to miss at least one undesirable input, especially if the code's environment changes. This can give attackers enough room to bypass the intended validation. However, denylists can be useful for detecting potential attacks or determining which inputs are so malformed that they should be rejected outright.

• Inputs should be decoded and canonicalized to the application's current internal representation before being validated (CWE-180). Make sure that the application does not decode the same input twice (CWE-174). Such errors could be used to bypass allowlist validation schemes by introducing dangerous inputs after they have been checked.

• Use and specify an output encoding that can be handled by the downstream component that is reading the output. Common encodings include ISO-8859-1, UTF-7, and UTF-8. When an encoding is not specified, a downstream component may choose a different encoding, either by assuming a default encoding or automatically inferring which encoding is being used, which can be erroneous. When the encodings are inconsistent, the downstream component might treat some character or byte sequences as special, even if they are not special in the original encoding. Attackers might then be able to exploit this discrepancy and conduct injection attacks; they even might be able to bypass protection mechanisms that assume the original encoding is also being used by the downstream component.

Functional Areas

file processing

AI Generated Translation

Common Consequences

riservatezza integrità controllo degli accessi

Impacts

leggere file o directory modificare file o directory elusione del meccanismo di protezione

Detection Methods

analisi statica automatizzata - binario o bytecode analisi statica manuale - binario o bytecode analisi dinamica con interpretazione automatica dei risultati analisi dinamica con interpretazione manuale dei risultati analisi statica manuale - codice sorgente analisi statica automatizzata - codice sorgente revisione dell'architettura o del design

Potential Mitigations

Phases:

implementazione

Descriptions:

• Assumi che tutti gli input siano dannosi. Utilizza una strategia di convalida degli input "accetta solo quelli noti come buoni", ovvero utilizza una lista di input accettabili che conformano strettamente alle specifiche. Rifiuta qualsiasi input che non rispetti rigorosamente le specifiche, o trasformalo in qualcosa che lo faccia. Quando esegui la convalida degli input, considera tutte le proprietà potenzialmente rilevanti, inclusi lunghezza, tipo di input, l'intera gamma di valori accettabili, input mancanti o in eccesso, sintassi, coerenza tra campi correlati e conformità alle regole di business. Come esempio di logica di regole di business, "boat" può essere sintatticamente valido perché contiene solo caratteri alfanumerici, ma non è valido se l'input è previsto per contenere solo colori come "red" o "blue". Non fare affidamento esclusivamente sulla ricerca di input dannosi o malformati. Questo potrebbe non individuare almeno un input indesiderato, specialmente se l'ambiente del codice cambia. Questo può dare agli attaccanti abbastanza spazio per aggirare la validazione prevista. Tuttavia, le liste di esclusione (denylists) possono essere utili per rilevare potenziali attacchi o determinare quali input sono così malformati da dover essere rifiutati immediatamente.

• Gli input devono essere decodificati e canonicalizzati nella rappresentazione interna corrente dell'applicazione prima di essere convalidati (CWE-180). Assicurarsi che l'applicazione non decodifichi lo stesso input due volte (CWE-174). Questi errori potrebbero essere utilizzati per aggirare i meccanismi di validazione basati su allowlist introducendo input pericolosi dopo che sono stati verificati.

• Utilizzare e specificare una codifica di output che possa essere gestita dal componente downstream che legge l'output. Le codifiche comuni includono ISO-8859-1, UTF-7 e UTF-8. Quando una codifica non viene specificata, un componente downstream potrebbe scegliere una codifica diversa, assumendo una codifica predefinita o inferendo automaticamente quale codifica viene utilizzata, il che può essere errato. Quando le codifiche sono incoerenti, il componente downstream potrebbe trattare alcuni caratteri o sequenze di byte come speciali, anche se non lo sono nella codifica originale. Gli attaccanti potrebbero quindi sfruttare questa discrepanza e condurre attacchi di injection; potrebbero addirittura riuscire a bypassare le meccaniche di protezione che presumono che anche la codifica originale venga utilizzata dal componente downstream.

Functional Areas

elaborazione file

CWE-41

Common Consequences

Impacts

Detection Methods

Potential Mitigations

Functional Areas

Common Consequences

Impacts

Detection Methods

Potential Mitigations

Functional Areas

Iscriviti alla newsletter