[FFmpeg-devel] [PATCH v3 2/2] avformat: add data_size for ff_hex_to_data()

Andreas Rheinhardt andreas.rheinhardt at outlook.com
Tue May 11 04:45:51 EEST 2021


lance.lmwang at gmail.com:
> On Tue, May 11, 2021 at 01:27:09AM +0200, Andreas Rheinhardt wrote:
>> lance.lmwang at gmail.com:
>>> On Mon, May 10, 2021 at 09:52:48PM +0200, Andreas Rheinhardt wrote:
>>>> lance.lmwang at gmail.com:
>>>>> From: Limin Wang <lance.lmwang at gmail.com>
>>>>>
>>>>> This prevents OOM in case of data buffer size is insufficient.
>>>>
>>>> OOM?
>>> Yes, it's invalid Out Of Memory access, not no memory available. 
>>> What's your suggestion?
>>>
>>
>> What you mean is commonly called "buffer overflow"; OOM exclusively
>> means "no memory available".
>> I already told you what I think should be done below.
> 
> Sorry, I didn't notice that.
> 
>>
>>>>
>>>>>
>>>>> Signed-off-by: Limin Wang <lance.lmwang at gmail.com>
>>>>> ---
>>>>>  libavformat/hls.c          | 4 +++-
>>>>>  libavformat/internal.h     | 6 ++++--
>>>>>  libavformat/rtpdec_latm.c  | 6 ++++--
>>>>>  libavformat/rtpdec_mpeg4.c | 6 ++++--
>>>>>  libavformat/utils.c        | 7 +++++--
>>>>>  5 files changed, 20 insertions(+), 9 deletions(-)
>>>>>
>>>>> diff --git a/libavformat/hls.c b/libavformat/hls.c
>>>>> index 8fc6924..d7d0387 100644
>>>>> --- a/libavformat/hls.c
>>>>> +++ b/libavformat/hls.c
>>>>> @@ -800,7 +800,9 @@ static int parse_playlist(HLSContext *c, const char *url,
>>>>>              if (!strcmp(info.method, "SAMPLE-AES"))
>>>>>                  key_type = KEY_SAMPLE_AES;
>>>>>              if (!av_strncasecmp(info.iv, "0x", 2)) {
>>>>> -                ff_hex_to_data(iv, info.iv + 2);
>>>>> +                ret = ff_hex_to_data(iv, sizeof(iv), info.iv + 2);
>>>>> +                if (ret < 0)
>>>>> +                    goto fail;
>>>>>                  has_iv = 1;
>>>>>              }
>>>>>              av_strlcpy(key, info.uri, sizeof(key));
>>>>> diff --git a/libavformat/internal.h b/libavformat/internal.h
>>>>> index d57e63c..3192aca 100644
>>>>> --- a/libavformat/internal.h
>>>>> +++ b/libavformat/internal.h
>>>>> @@ -423,10 +423,12 @@ char *ff_data_to_hex(char *buf, const uint8_t *src, int size, int lowercase);
>>>>>   * digits is ignored.
>>>>>   *
>>>>>   * @param data if non-null, the parsed data is written to this pointer
>>>>> + * @param data_size the data buffer size
>>>>>   * @param p the string to parse
>>>>> - * @return the number of bytes written (or to be written, if data is null)
>>>>> + * @return the number of bytes written (or to be written, if data is null),
>>>>> + *  or < 0 in case data_size is insufficient if data isn't null.
>>>>>   */
>>>>> -int ff_hex_to_data(uint8_t *data, const char *p);
>>>>> +int ff_hex_to_data(uint8_t *data, int data_size, const char *p);
>>>>
>>>> This is unnecessary, as none of the callers need it: The rtpdec users
>>>> call ff_hex_to_data() twice, once to get the necessary size, once to
>>>> write the data. And the hls buffer is already big enough. I only see two
> 
> Yes, most of caller is call ff_hex_to_data() twice, but hls is using iv[16],
> We can't assume the string is 128bit only as cracker can change the m3u8 and
> make the buffer overflow. For the data may be array, so I prefer to add the
> memory overflow check. In theory, it's big enough, but we can't assume it.
> 

How could this happen? Even if the m3u8 is changed concurrently,
ff_parse_key_value() will always write a NUL-terminated string into
info.iv and said string won't change even if the m3u8 is changed.

> 
>>>> things that could be improved: Return size_t in ff_hex_to_data() as
>>>> that's the proper type (this includes checks in the callers for whether
>>>> the return type fit into the type of the extradata size)) and making the
>>>> size of the iv automatically match the needed size of (struct keyinfo).iv.
> 
> Maybe I think it's better to alloc in ff_hex_to_data function and return the
> buffer directly.
> 

This has several drawbacks: The user might know an upper bound of the
buffer size in advance, so that an allocation is unnecessary; the user
might not want a huge buffer at all (this happens with the rtp users:
they should reject lengths that don't fit into an int (anything that
comes close to that bound is probably bullshit anyway)); or the user has
special needs (this also happens with rtp: they need it as extradata,
i.e. it needs to be padded).

>>>>
>>>>>  
>>>>>  /**
>>>>>   * Add packet to an AVFormatContext's packet_buffer list, determining its
>>>>> diff --git a/libavformat/rtpdec_latm.c b/libavformat/rtpdec_latm.c
>>>>> index 104a00a..cf1d581 100644
>>>>> --- a/libavformat/rtpdec_latm.c
>>>>> +++ b/libavformat/rtpdec_latm.c
>>>>> @@ -91,7 +91,7 @@ static int latm_parse_packet(AVFormatContext *ctx, PayloadContext *data,
>>>>>  
>>>>>  static int parse_fmtp_config(AVStream *st, const char *value)
>>>>>  {
>>>>> -    int len = ff_hex_to_data(NULL, value), i, ret = 0;
>>>>> +    int len = ff_hex_to_data(NULL, 0, value), i, ret = 0;
>>>>>      GetBitContext gb;
>>>>>      uint8_t *config;
>>>>>      int audio_mux_version, same_time_framing, num_programs, num_layers;
>>>>> @@ -100,7 +100,9 @@ static int parse_fmtp_config(AVStream *st, const char *value)
>>>>>      config = av_mallocz(len + AV_INPUT_BUFFER_PADDING_SIZE);
>>>>>      if (!config)
>>>>>          return AVERROR(ENOMEM);
>>>>> -    ff_hex_to_data(config, value);
>>>>> +    ret = ff_hex_to_data(config, len, value);
>>>>> +    if (ret < 0)
>>>>> +        return ret;
>>>>>      init_get_bits(&gb, config, len*8);
>>>>>      audio_mux_version = get_bits(&gb, 1);
>>>>>      same_time_framing = get_bits(&gb, 1);
>>>>> diff --git a/libavformat/rtpdec_mpeg4.c b/libavformat/rtpdec_mpeg4.c
>>>>> index 34c7950..27751df 100644
>>>>> --- a/libavformat/rtpdec_mpeg4.c
>>>>> +++ b/libavformat/rtpdec_mpeg4.c
>>>>> @@ -112,11 +112,13 @@ static void close_context(PayloadContext *data)
>>>>>  static int parse_fmtp_config(AVCodecParameters *par, const char *value)
>>>>>  {
>>>>>      /* decode the hexa encoded parameter */
>>>>> -    int len = ff_hex_to_data(NULL, value), ret;
>>>>> +    int len = ff_hex_to_data(NULL, 0, value), ret;
>>>>>  
>>>>>      if ((ret = ff_alloc_extradata(par, len)) < 0)
>>>>>          return ret;
>>>>> -    ff_hex_to_data(par->extradata, value);
>>>>> +    ret = ff_hex_to_data(par->extradata, par->extradata_size, value);
>>>>> +    if (ret < 0)
>>>>> +        return ret;>      return 0;
>>>>>  }
>>>>>  
>>>>> diff --git a/libavformat/utils.c b/libavformat/utils.c
>>>>> index 9228313..dfe9f4c 100644
>>>>> --- a/libavformat/utils.c
>>>>> +++ b/libavformat/utils.c
>>>>> @@ -4768,7 +4768,7 @@ char *ff_data_to_hex(char *buff, const uint8_t *src, int s, int lowercase)
>>>>>      return buff;
>>>>>  }
>>>>>  
>>>>> -int ff_hex_to_data(uint8_t *data, const char *p)
>>>>> +int ff_hex_to_data(uint8_t *data, int data_size, const char *p)
>>>>>  {
>>>>>      int c, len, v;
>>>>>  
>>>>> @@ -4787,8 +4787,11 @@ int ff_hex_to_data(uint8_t *data, const char *p)
>>>>>              break;
>>>>>          v = (v << 4) | c;
>>>>>          if (v & 0x100) {
>>>>> -            if (data)
>>>>> +            if (data) {
>>>>> +                if (len >= data_size)
>>>>> +                    return AVERROR(ERANGE);
>>>>>                  data[len] = v;
>>>>> +            }
>>>>>              len++;
>>>>>              v = 1;
>>>>>          }
>>>>>
>>>>



More information about the ffmpeg-devel mailing list