Fails to load PE binaries with non-utf-8 decodable bytes in section name #438

AlexVanMechelen · 2023-11-08T19:30:04Z

Description

Loading a PE binary with non-utf-8 decodable bytes in the section name of one of its sections causes a crash here

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xd2 in position 6: invalid continuation byte

Could add errors='ignore' flag to .decode() to drop non-utf-8 decodable bytes

ltfish · 2023-11-08T19:57:49Z

@rhelmot I've been thinking about this for a while. Should we use bytes instead of str for section and segment names?

rhelmot · 2023-11-08T20:37:00Z

My question for OP is: are your section names encoded in some other encoding, or are they garbage?

AlexVanMechelen · 2023-11-08T21:05:56Z

@rhelmot The section names are garbage. Some executable packers create such garbage section names, leading to the above error for all executables packed with them.

rhelmot · 2023-11-08T23:34:43Z

Does any compiler support generating utf-8 section names? If so, I would recommend adding the error-replace utf-8 decoding. If not, Latin-1.

ltfish · 2023-11-09T00:00:22Z

@rhelmot Some malware intentionally makes their section names garbage. I don't think we want to fail to load those binaries in such cases.

rhelmot · 2023-11-09T00:00:56Z

Neither of those solutions will fail with garbage bytes.

ltfish · 2023-11-09T05:55:01Z

Why don't we default to latin-1?

rhelmot · 2023-11-09T05:56:22Z

That's why I asked the question about whether compilers let you generate utf8 section names manually

AlexVanMechelen added bug needs-triage labels Nov 8, 2023

ltfish removed the needs-triage label Nov 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fails to load PE binaries with non-utf-8 decodable bytes in section name #438

Fails to load PE binaries with non-utf-8 decodable bytes in section name #438

AlexVanMechelen commented Nov 8, 2023 •

edited

Loading

ltfish commented Nov 8, 2023

rhelmot commented Nov 8, 2023

AlexVanMechelen commented Nov 8, 2023

rhelmot commented Nov 8, 2023

ltfish commented Nov 9, 2023

rhelmot commented Nov 9, 2023

ltfish commented Nov 9, 2023

rhelmot commented Nov 9, 2023

Fails to load PE binaries with non-utf-8 decodable bytes in section name #438

Fails to load PE binaries with non-utf-8 decodable bytes in section name #438

Comments

AlexVanMechelen commented Nov 8, 2023 • edited Loading

Description

ltfish commented Nov 8, 2023

rhelmot commented Nov 8, 2023

AlexVanMechelen commented Nov 8, 2023

rhelmot commented Nov 8, 2023

ltfish commented Nov 9, 2023

rhelmot commented Nov 9, 2023

ltfish commented Nov 9, 2023

rhelmot commented Nov 9, 2023

AlexVanMechelen commented Nov 8, 2023 •

edited

Loading