오디오 및 비디오 시작의 기본 사항: H.264 주제 (12) - FFmpeg 소스 code_developdoc에서 SPS 속성을 통해 비디오 해상도 계산 구현

오디오 및 비디오 시작의 기본 사항: H.264 주제(12) - FFmpeg 소스 코드의 SPS 속성을 통해 비디오 해상도 계산 구현

2024-07-12

I. 소개

이전 섹션에서 "오디오 및 비디오 시작의 기본 사항: H.264 주제 (11) - 비디오 해상도 계산 공식 ""는 SPS의 속성을 통해 H.264로 인코딩된 비디오의 해상도를 계산하는 공식을 설명합니다. 이 기사에서는 FFmpeg 소스 코드에서 비디오 해상도 계산 구현을 설명합니다.

2. FFmpeg 소스 코드에서 비디오 해상도 계산 구현

기사에서 "오디오 및 비디오 시작의 기본: H.264 주제(10) - FFmpeg 소스 코드에서 SPS 속성을 저장하는 구조 및 SPS 디코딩 기능 분석》를 보면, FFmpeg 소스 코드는 ff_h264_decode_seq_parameter_set 함수를 통해 SPS를 디코딩하여 SPS의 속성을 가져오는 것을 알 수 있습니다.

ff_h264_decode_seq_parameter_set 함수에는 다음 코드가 있습니다. 코드의 다음 부분을 통해 비디오 해상도를 계산하는 데 필요한 속성을 얻습니다.


int ff_h264_decode_seq_parameter_set(GetBitContext *gb, AVCodecContext *avctx,
                                     H264ParamSets *ps, int ignore_truncation)
{
    //...
    
    sps->gaps_in_frame_num_allowed_flag = get_bits1(gb);
    sps->mb_width                       = get_ue_golomb(gb) + 1;
    sps->mb_height                      = get_ue_golomb(gb) + 1;
 
    sps->frame_mbs_only_flag = get_bits1(gb);
 
    if (sps->mb_height >= INT_MAX / 2U) {
        av_log(avctx, AV_LOG_ERROR, "height overflown");
        goto fail;
    }
    sps->mb_height *= 2 - sps->frame_mbs_only_flag;
 
    //...
 
    sps->crop = get_bits1(gb);
    if (sps->crop) {
        unsigned int crop_left   = get_ue_golomb(gb);
        unsigned int crop_right  = get_ue_golomb(gb);
        unsigned int crop_top    = get_ue_golomb(gb);
        unsigned int crop_bottom = get_ue_golomb(gb);
        int width  = 16 * sps->mb_width;
        int height = 16 * sps->mb_height;
 
        if (avctx->flags2 & AV_CODEC_FLAG2_IGNORE_CROP) {
            av_log(avctx, AV_LOG_DEBUG, "discarding sps cropping, original "
                                           "values are l:%d r:%d t:%d b:%dn",
                   crop_left, crop_right, crop_top, crop_bottom);
 
            sps->crop_left   =
            sps->crop_right  =
            sps->crop_top    =
            sps->crop_bottom = 0;
        } else {
            int vsub   = (sps->chroma_format_idc == 1) ? 1 : 0;
            int hsub   = (sps->chroma_format_idc == 1 ||
                          sps->chroma_format_idc == 2) ? 1 : 0;
            int step_x = 1 << hsub;
            int step_y = (2 - sps->frame_mbs_only_flag) << vsub;
 
            if (crop_left  > (unsigned)INT_MAX / 4 / step_x ||
                crop_right > (unsigned)INT_MAX / 4 / step_x ||
                crop_top   > (unsigned)INT_MAX / 4 / step_y ||
                crop_bottom> (unsigned)INT_MAX / 4 / step_y ||
                (crop_left + crop_right ) * step_x >= width ||
                (crop_top  + crop_bottom) * step_y >= height
            ) {
                av_log(avctx, AV_LOG_ERROR, "crop values invalid %d %d %d %d / %d %dn",     
                      crop_left, crop_right, crop_top, crop_bottom, width, height);
                goto fail;
            }
 
            sps->crop_left   = crop_left   * step_x;
            sps->crop_right  = crop_right  * step_x;
            sps->crop_top    = crop_top    * step_y;
            sps->crop_bottom = crop_bottom * step_y;
        }
    } else {
        sps->crop_left   =
        sps->crop_right  =
        sps->crop_top    =
        sps->crop_bottom =
        sps->crop        = 0;
    }
 
    //...
}

그런 다음 FFmpeg 소스 코드의 소스 파일 libavcodec/h264_parser.c의parse_nal_units 함수에 다음 코드가 있습니다.


static inline int parse_nal_units(AVCodecParserContext *s,
                                  AVCodecContext *avctx,
                                  const uint8_t * const buf, int buf_size)
{
    //...
    
    for (;;) {
        switch (nal.type) {
        case H264_NAL_SPS:
            ff_h264_decode_seq_parameter_set(&nal.gb, avctx, &p->ps, 0);
            break;
         
        //...
 
        case H264_NAL_IDR_SLICE:
        
        //...
 
        s->coded_width  = 16 * sps->mb_width;
        s->coded_height = 16 * sps->mb_height;
        s->width        = s->coded_width  - (sps->crop_right + sps->crop_left);
        s->height       = s->coded_height - (sps->crop_top   + sps->crop_bottom);
        if (s->width <= 0 || s->height <= 0) {
            s->width  = s->coded_width;
            s->height = s->coded_height;
        }
        //... 
        }
        //...
    }
}

pars_nal_units 함수에서 다음 명령문을 통해 최종적으로 비디오 해상도를 얻는 것을 볼 수 있습니다.


s->width = s->coded_width - (sps->crop_right + sps->crop_left);
s->height = s->coded_height - (sps->crop_top + sps->crop_bottom);

FFmpeg 소스 코드와 기사에서 비디오 해상도 계산 구현을 볼 수 있습니다.오디오 및 비디오 시작의 기본 사항: H.264 주제 (11) - 비디오 해상도 계산 공식에 설명된 공식은 일관됩니다.

기술나눔

오디오 및 비디오 시작의 기본 사항: H.264 주제(12) - FFmpeg 소스 코드의 SPS 속성을 통해 비디오 해상도 계산 구현

I. 소개

2. FFmpeg 소스 코드에서 비디오 해상도 계산 구현

개인 프로필

내 연락처 정보