ffmpeg推流本地MP4 h264视频文件到rtsp服务器，rtsp客户端NVDECODE无法解码

问题描述：
最近在音视频遇到一个问题，因为需要，必须采用英伟达的NVDEC解码接受到的rtsp流，刚开始从摄像头拉流，解码非常正常，后来摄像头没了，就采用ffmpeg+ZLMediaKit进行rtsp推流，然后拉流解码，但是但是，结果突然就无法解码了。

解决：
首先想到的第一个是推的流有问题，结果用VLC又能打开。
第二个想到的是bsf的h264_mp4toannexb转换有误，
参考：https://github.com/gongluck/AnalysisAVP#%E6%B5%81%E5%AA%92%E4%BD%93%E5%8D%8F%E8%AE%AE
将bsf转换后的流打印出来，发现也没有问题，有startcodec ，但是惊奇的发现关键帧前面没有sps，pps。果然问题在这里，那么我们在将其送入到解码器之前，在关键帧之前加入sps和pps就可以了。

关于这个问题网上已经有人给了解决方案：
从mp4,flv文件中解析出h264和aac,送解码器解码失败
https://www.cnblogs.com/lihaiping/p/5285166.html

然后将摄像头找回来做同样的事情，用分析工具一览：
这个工具非常好，可以从网上找到如下：
可以发现摄像头的h264流是自带sps,pps,sei的。

特别注意67 sps 68 pps 106 sei 61

但是如何将这些信息补充上去呢，其实前面说了ffmpeg提供了bit stram filter函数：

av_bsf_get_by_name("h264_mp4toannexb")
av_bsf_alloc（）
av_bsf_init（）
av_bsf_send_packet（）
av_bsf_receive_packet（）

这组函数通过一个过滤器h264_mp4toannexb_filter，将AVCodecParameters中extradata中的数据转换为sps,pps,sei后补充上去。
补充上去之后，英伟达解码器就可以正常解码了。
参考：https://www.cnblogs.com/nsnow/p/3862709.html

那么我这里也是正常加载了过滤器h264_mp4toannexb，为什么还是没有sps，pps，sei呢？继续查找原因：
原因是推流过来的rtsp流已经进行过h264_mp4toannexb转换了，所以bsf过滤器失效了，但是流中又没有在IDR帧中插入sps pps 等信息，所以手动插入：

auto nbsfRet = av_bsf_receive_packet(m_bsfc, &m_pktFiltered);
//在其后边加入
appendPPSandSPS(m_pktFiltered.data, m_pktFiltered.size,
                    m_bsfc->par_in->extradata, m_bsfc->par_in->extradata_size);
//m_bsfc->par_in->extradata已经就是pps sps了
//0 00 00 01 67 64 00 32 AC 2C 6A 80 A0 02 D6 9B 80 80 80 A0 00 00 E1 00 00 2B F2 00 80 00 00 00 01 68 EE 3C B0
                    
bool appendPPSandSPS(uint8_t *src, unsigned int len1,
                                   uint8_t *inf, unsigned int len2) {
  bool ret = false;
  uint8_t *hdr = nullptr;
  if ((len1 > 5 && src[0] == 0 && src[1] == 0 && src[2] == 0 && src[3] == 1) &&
      (len2 > 5 && inf[0] == 0 && inf[1] == 0 && inf[2] == 0 && inf[3] == 67)) {
    src = (uint8_t *)realloc(src, len1 + len2);
    hdr = src;
    memmove(src + len2, src, len1);
    memcpy(hdr, inf, len2);
    ret = true;
  }
  return ret;
}

这里说一下，m_bsfc->par_in这个是AVCodecParameters，这个是输入流的解码参数，在创建avformat_open_input（）中获取到的codecpar,它里面的extradata存放的就是sps,pps，其实它还有一个参数siede_data可以参考：https://blog.csdn.net/bolitongyue/article/details/109053503

使用RTP传输H264的时候,需要用到sdp协议描述,其中有两项:Sequence Parameter Sets (SPS) 和Picture Parameter Set (PPS)需要用到,那么这两项从哪里获取呢?答案是从H264码流中获取.在H264码流中,都是以"0x00 0x00 0x01"或者"0x00 0x00 0x00 0x01"为开始码的,找到开始码之后,使用开始码之后的第一个字节的低5位判断是否为7(sps)或者8(pps), 及data[4] & 0x1f == 7 || data[4] & 0x1f == 8.然后对获取的nal去掉开始码之后进行base64编码,得到的信息就可以用于sdp.sps和pps需要用逗号分隔开来.

SDP中的H.264的SPS和PPS串，包含了初始化H.264解码器所需要的信息参数，包括编码所用的profile，level，图像的宽和高，deblock滤波器等。
sdp中正常是包含sps和pps内容的，但是有时候sdp中没有sps和pps这些内容，这时候就要从流中获取。
详细看这里：https://blog.csdn.net/Jody1989/article/details/46127561

H264码流中SPS PPS详解
https://zhuanlan.zhihu.com/p/27896239

在这里看一个sdp会话

v=0
o=- 0 0 IN IP4 192.168.1.112
s=Stream-0
i=N/A
c=IN IP4 192.168.1.104
t=0 0
a=recvonly
m=video 5006 RTP/AVP 96
a=rtpmap:96 H264/90000
a=fmtp:96 packetization-mode=1;profile-level-id=42c016;sprop-parameter-sets=Z0LAFqtAUB7QgAAAAwCAAAAPR4sXUA==,aM48gA==;
a=control:trackID=1
————————————————
版权声明：本文为CSDN博主「Devil_Lee」的原创文章，遵循CC 4.0 BY-SA版权协议，转载请附上原文出处链接及本声明。
原文链接：https://blog.csdn.net/devil__lee/article/details/9717471

其中主要的3个参数即为 profile-level-id sprop-parameter-sets

其中sprop-parameter-sets 包含sps和pps的信息，以逗号隔开。

那么这3个数值从哪里获取？有什么含义？

可以参考：http://www.cnblogs.com/skyseraph/archive/2012/04/01/2429384.html

通过这段时间的研究发现，sprop-parameter-sets 后面的数值为码流中sps和pps获取的值经过base64编码以后的数值，

profile-level-id为sps数值67以后的3个字节对应的16进制字符串

那么获取了sps和pps的数值以后就可以获取sdp文件的关键信息了

如果采用ffmpeg拉流，ffmpeg会解析sps和pps,然后存放在AVFormatContext中streams[0]中的codecpar中的extradata中。

有兴趣继续看一下过滤函数吧。
在 av_bsf_init(bsfcCtx)的时候，有一个非常重要的参数：
AVBSFContext *bsfcCtx

typedef struct AVBSFContext {
    /**
     * A class for logging and AVOptions
     */
    const AVClass *av_class;

    /**
     * The bitstream filter this context is an instance of.
     */
    const struct AVBitStreamFilter *filter;

    /**
     * Opaque libavcodec internal data. Must not be touched by the caller in any
     * way.
     */
    AVBSFInternal *internal;

    /**
     * Opaque filter-specific private data. If filter->priv_class is non-NULL,
     * this is an AVOptions-enabled struct.
     */
    void *priv_data;//H264BSFContext这个结构体

    /**
     * Parameters of the input stream. This field is allocated in
     * av_bsf_alloc(), it needs to be filled by the caller before
     * av_bsf_init().
     */
    AVCodecParameters *par_in;//这个参数中存储的就是输入流的AVCodecParameters

    /**
     * Parameters of the output stream. This field is allocated in
     * av_bsf_alloc(), it is set by the filter in av_bsf_init().
     */
    AVCodecParameters *par_out;

    /**
     * The timebase used for the timestamps of the input packets. Set by the
     * caller before av_bsf_init().
     */
    AVRational time_base_in;

    /**
     * The timebase used for the timestamps of the output packets. Set by the
     * filter in av_bsf_init().
     */
    AVRational time_base_out;
} AVBSFContext;

我们看看它的创建：

const AVBitStreamFilter *bsf;
bsf = av_bsf_get_by_name("h264_mp4toannexb");    if (!bsf) {
        LOG_INFO("h264_mp4toannexb bitstream filter get failed");
        return false;
      }

const AVBitStreamFilter *av_bsf_get_by_name(const char *name)
{
    const AVBitStreamFilter *f = NULL;
    void *i = 0;

    if (!name)
        return NULL;
//遍历找到fileter,那么我们有多少过滤器呢
    while ((f = av_bsf_iterate(&i))) {//遍历返回一个bsf过滤器
        if (!strcmp(f->name, name))
            return f;
    }

    return NULL;
}
//
//ffmpeg有这么多过滤器
static const AVBitStreamFilter * const bitstream_filters[] = {
    &ff_aac_adtstoasc_bsf,
    &ff_av1_frame_split_bsf,
    &ff_av1_metadata_bsf,
    &ff_chomp_bsf,
    &ff_dump_extradata_bsf,
    &ff_dca_core_bsf,
    &ff_eac3_core_bsf,
    &ff_extract_extradata_bsf,
    &ff_filter_units_bsf,
    &ff_h264_metadata_bsf,
    &ff_h264_mp4toannexb_bsf,
    &ff_h264_redundant_pps_bsf,
    &ff_hapqa_extract_bsf,
    &ff_hevc_metadata_bsf,
    &ff_hevc_mp4toannexb_bsf,
    &ff_imx_dump_header_bsf,
    &ff_mjpeg2jpeg_bsf,
    &ff_mjpega_dump_header_bsf,
    &ff_mp3_header_decompress_bsf,
    &ff_mpeg2_metadata_bsf,
    &ff_mpeg4_unpack_bframes_bsf,
    &ff_mov2textsub_bsf,
    &ff_noise_bsf,
    &ff_null_bsf,
    &ff_prores_metadata_bsf,
    &ff_remove_extradata_bsf,
    &ff_text2movsub_bsf,
    &ff_trace_headers_bsf,
    &ff_truehd_core_bsf,
    &ff_vp9_metadata_bsf,
    &ff_vp9_raw_reorder_bsf,
    &ff_vp9_superframe_bsf,
    &ff_vp9_superframe_split_bsf,
    NULL };

/

auto ret=  av_bsf_alloc(bsf, &m_bsfc);
int av_bsf_alloc(const AVBitStreamFilter *filter, AVBSFContext **pctx)
{
    AVBSFContext *ctx;
    int ret;

    ctx = av_mallocz(sizeof(*ctx));
    if (!ctx)
        return AVERROR(ENOMEM);

    ctx->av_class = &bsf_class;
    //这里进行过滤器赋值
    ctx->filter   = filter;

    ctx->par_in  = avcodec_parameters_alloc();
    ctx->par_out = avcodec_parameters_alloc();
    if (!ctx->par_in || !ctx->par_out) {
        ret = AVERROR(ENOMEM);
        goto fail;
    }

    ctx->internal = av_mallocz(sizeof(*ctx->internal));
    if (!ctx->internal) {
        ret = AVERROR(ENOMEM);
        goto fail;
    }

    ctx->internal->buffer_pkt = av_packet_alloc();
    if (!ctx->internal->buffer_pkt) {
        ret = AVERROR(ENOMEM);
        goto fail;
    }

    av_opt_set_defaults(ctx);

    /* allocate priv data and init private options */
    if (filter->priv_data_size) {
    //这里存储的一般是option参数等
        ctx->priv_data = av_mallocz(filter->priv_data_size);
        if (!ctx->priv_data) {
            ret = AVERROR(ENOMEM);
            goto fail;
        }
        //将filter中的priv_class赋值为ctx中的priv_data
        if (filter->priv_class) {
            *(const AVClass **)ctx->priv_data = filter->priv_class;
            av_opt_set_defaults(ctx->priv_data);
        }
    }

    *pctx = ctx;
    return 0;
fail:
    av_bsf_free(&ctx);
    return ret;
}

接下来进行par_in赋值

//这里主要进行的就是extradata的拷贝
avcodec_parameters_copy(m_bsfc->par_in, m_param.codecParams);
int avcodec_parameters_copy(AVCodecParameters *dst, const AVCodecParameters *src)
{
    codec_parameters_reset(dst);
    memcpy(dst, src, sizeof(*dst));

    dst->extradata      = NULL;
    dst->extradata_size = 0;
    if (src->extradata) {
    //这里存储的就是sps以及pps
    //这些数据是ffmpeg从rtsp中的sdp中解析出来的，有时候sdp中没有sps和pps信息，那么说明流中包含了sps和pps
    //如果sdp中包含了sps和pps那么流中就不包含sps和pps
    //00 00 00 01 67 64 00 32 AC D9 40 28 00 B5 A6 C8 00 00 03 00 08 00 00 03 01 90 78 C1 8C B0 
    //00 00 00 01 68 EB E3 CB 22 C0 
        dst->extradata = av_mallocz(src->extradata_size + AV_INPUT_BUFFER_PADDING_SIZE);
        if (!dst->extradata)
            return AVERROR(ENOMEM);
        memcpy(dst->extradata, src->extradata, src->extradata_size);
        dst->extradata_size = src->extradata_size;
    }

    return 0;
}

其中codecParams参数如下：
接下来进行av_bsf_init，这个时候bsfc的值如下，其中initernal是存放输入packet的。

av_bsf_init(m_bsfc);
//检查了一下filter中编解码id（AV_CODEC_ID_H264）是否支持，然后对par_out赋值
int av_bsf_init(AVBSFContext *ctx)
{
    int ret, i;

    /* check that the codec is supported */
   /
   static void h264_mp4toannexb_flush(AVBSFContext *ctx)
{
    H264BSFContext *s = ctx->priv_data;

    s->idr_sps_seen = 0;
    s->idr_pps_seen = 0;
    s->new_idr      = s->extradata_parsed;
}

static const enum AVCodecID codec_ids[] = {
    AV_CODEC_ID_H264, AV_CODEC_ID_NONE,
};

const AVBitStreamFilter ff_h264_mp4toannexb_bsf = {
    .name           = "h264_mp4toannexb",
    .priv_data_size = sizeof(H264BSFContext),
    .init           = h264_mp4toannexb_init,
    .filter         = h264_mp4toannexb_filter,
    .flush          = h264_mp4toannexb_flush,
    .codec_ids      = codec_ids,
};
  
    if (ctx->filter->codec_ids) {
    //查看输入流的code_id是否被filter所支持
        for (i = 0; ctx->filter->codec_ids[i] != AV_CODEC_ID_NONE; i++)
            if (ctx->par_in->codec_id == ctx->filter->codec_ids[i])
                break;
         //如果遍历到了最后，那就说明不支持
        if (ctx->filter->codec_ids[i] == AV_CODEC_ID_NONE) {
            const AVCodecDescriptor *desc = avcodec_descriptor_get(ctx->par_in->codec_id);
            av_log(ctx, AV_LOG_ERROR, "Codec '%s' (%d) is not supported by the "
                   "bitstream filter '%s'. Supported codecs are: ",
                   desc ? desc->name : "unknown", ctx->par_in->codec_id, ctx->filter->name);
            for (i = 0; ctx->filter->codec_ids[i] != AV_CODEC_ID_NONE; i++) {
                desc = avcodec_descriptor_get(ctx->filter->codec_ids[i]);
                av_log(ctx, AV_LOG_ERROR, "%s (%d) ",
                       desc ? desc->name : "unknown", ctx->filter->codec_ids[i]);
            }
            av_log(ctx, AV_LOG_ERROR, "\n");
            return AVERROR(EINVAL);
        }
    }

    /* initialize output parameters to be the same as input
     * init below might overwrite that */
    ret = avcodec_parameters_copy(ctx->par_out, ctx->par_in);
    if (ret < 0)
        return ret;

    ctx->time_base_out = ctx->time_base_in;

    if (ctx->filter->init) {
    //h264_mp4toannexb_init
        ret = ctx->filter->init(ctx);
        if (ret < 0)
            return ret;
    }

    return 0;
}

static int h264_mp4toannexb_init(AVBSFContext *ctx)
{
    H264BSFContext *s = ctx->priv_data;
    //这里将sps pps等数据大小拷贝一下
    int extra_size = ctx->par_in->extradata_size;
    int ret;

    /* retrieve sps and pps NAL units from extradata */
    //在这里流程中The input looks like it is Annex B already，也就是
    //输入的extra_data已经是Annex B了
    /
#ifndef AV_RB24
#   define AV_RB24(x)                           \
    ((((const uint8_t*)(x))[0] << 16) |         \
     (((const uint8_t*)(x))[1] <<  8) |         \
      ((const uint8_t*)(x))[2])
#endif

#ifndef AV_RB32
#   define AV_RB32(x)                                \
    (((uint32_t)((const uint8_t*)(x))[0] << 24) |    \
               (((const uint8_t*)(x))[1] << 16) |    \
               (((const uint8_t*)(x))[2] <<  8) |    \
                ((const uint8_t*)(x))[3])
#endif
    
    //这里在做什么 AV_RB24
    //0x00 00 01
    //0x00 00 00 00 01
    //查看是否是上述包头
    if (!extra_size                                               ||
        (extra_size >= 3 && AV_RB24(ctx->par_in->extradata) == 1) ||
        (extra_size >= 4 && AV_RB32(ctx->par_in->extradata) == 1)) {
        av_log(ctx, AV_LOG_VERBOSE,
               "The input looks like it is Annex B already\n");
    } else if (extra_size >= 6) {
        ret = h264_extradata_to_annexb(ctx, AV_INPUT_BUFFER_PADDING_SIZE);
        if (ret < 0)
            return ret;

        s->length_size      = ret;
        s->new_idr          = 1;
        s->idr_sps_seen     = 0;
        s->idr_pps_seen     = 0;
        s->extradata_parsed = 1;
    } else {
        av_log(ctx, AV_LOG_ERROR, "Invalid extradata size: %d\n", extra_size);
        return AVERROR_INVALIDDATA;
    }

    return 0;
}

下面看一下h264_extradata_to_annexb
总的来说H264的码流的打包方式有两种,一种为annex-b byte stream format的格式，这个是绝大部分编码器的默认输出格式，就是每个帧的开头的3~4个字节是H264的start_code,0x00000001或者0x000001。
另一种是原始的NAL打包格式，就是开始的若干字节（1，2，4字节）是NAL的长度，而不是start_code,此时必须借助某个全局的数据来获得编码器的profile,level,PPS,SPS等信息才可以解码。

我一直疑问为什么有些视频解码时显示格式是:H264，大部分又是：AVC1
我在搜索编程资料时在微软的msdn上发现的：
原文:http://msdn.microsoft.com/en-us/library/dd757808(v=vs.85).aspx
FOURCC:AVC1 描述:H.264 bitstream without start codes.
FOURCC:H264 描述:H.264 bitstream with start codes.
mp4 文件中的h264 avc1格式介绍
https://blog.csdn.net/haima1998/article/details/50426944/

下面这篇文章很详细的介绍了h264_extradata_to_annexb的作用
ffmpeg 从mp4上提取H264的nalu
https://blog.csdn.net/gavinr/article/details/7183499

H264—MP4格式及在MP4文件中提取H264的SPS、PPS及码流
https://www.cnblogs.com/skyseraph/archive/2012/04/01/2429384.html

使用FFmpeg提取MP4中的H264和AAC
https://jiangdg.blog.csdn.net/article/details/102665541

static int h264_extradata_to_annexb(AVBSFContext *ctx, const int padding)
{
    H264BSFContext *s = ctx->priv_data;
    //
    typedef struct H264BSFContext {
    int32_t  sps_offset;
    int32_t  pps_offset;
    uint8_t  length_size;
    uint8_t  new_idr;
    uint8_t  idr_sps_seen;
    uint8_t  idr_pps_seen;
    int      extradata_parsed;
} H264BSFContext;
//
    uint16_t unit_size;
    uint64_t total_size                 = 0;
    uint8_t *out                        = NULL, unit_nb, sps_done = 0,
             sps_seen                   = 0, pps_seen = 0;
    const uint8_t *extradata            = ctx->par_in->extradata + 4;
    static const uint8_t nalu_header[4] = { 0, 0, 0, 1 };
    //0000 0011，因为nal包后5位是包的长度，这里计算出长度+1
    int length_size = (*extradata++ & 0x3) + 1; // retrieve length coded size

    s->sps_offset = s->pps_offset = -1;

    /* retrieve sps and pps unit(s) */
    //0x1f 0001 1111 下面这句就是提取 nal中的后5位
    unit_nb = *extradata++ & 0x1f; /* number of sps unit(s) */
    if (!unit_nb) {
        goto pps;
    } else {
        s->sps_offset = 0;
        sps_seen = 1;
    }

    while (unit_nb--) {
        int err;

        unit_size   = AV_RB16(extradata);
        total_size += unit_size + 4;
        if (total_size > INT_MAX - padding) {
            av_log(ctx, AV_LOG_ERROR,
                   "Too big extradata size, corrupted stream or invalid MP4/AVCC bitstream\n");
            av_free(out);
            return AVERROR(EINVAL);
        }
        if (extradata + 2 + unit_size > ctx->par_in->extradata + ctx->par_in->extradata_size) {
            av_log(ctx, AV_LOG_ERROR, "Packet header is not contained in global extradata, "
                   "corrupted stream or invalid MP4/AVCC bitstream\n");
            av_free(out);
            return AVERROR(EINVAL);
        }
        if ((err = av_reallocp(&out, total_size + padding)) < 0)
            return err;
        memcpy(out + total_size - unit_size - 4, nalu_header, 4);
        memcpy(out + total_size - unit_size, extradata + 2, unit_size);
        extradata += 2 + unit_size;
pps:
        if (!unit_nb && !sps_done++) {
            unit_nb = *extradata++; /* number of pps unit(s) */
            if (unit_nb) {
                s->pps_offset = total_size;
                pps_seen = 1;
            }
        }
    }

    if (out)
        memset(out + total_size, 0, padding);

    if (!sps_seen)
        av_log(ctx, AV_LOG_WARNING,
               "Warning: SPS NALU missing or invalid. "
               "The resulting stream may not play.\n");

    if (!pps_seen)
        av_log(ctx, AV_LOG_WARNING,
               "Warning: PPS NALU missing or invalid. "
               "The resulting stream may not play.\n");

    av_freep(&ctx->par_out->extradata);
    ctx->par_out->extradata      = out;
    ctx->par_out->extradata_size = total_size;

    return length_size;
}

接着就是循环处理了

auto ret = av_bsf_send_packet(m_bsfc, inPacket);
//这个函数的主要作用就是把pkt转到ctx->internal->buffer_pkt中
int av_bsf_send_packet(AVBSFContext *ctx, AVPacket *pkt)
{
    int ret;

    if (!pkt || (!pkt->data && !pkt->side_data_elems)) {
        ctx->internal->eof = 1;
        return 0;
    }

    if (ctx->internal->eof) {
        av_log(ctx, AV_LOG_ERROR, "A non-NULL packet sent after an EOF.\n");
        return AVERROR(EINVAL);
    }

    if (ctx->internal->buffer_pkt->data ||
        ctx->internal->buffer_pkt->side_data_elems)
        return AVERROR(EAGAIN);

    ret = av_packet_make_refcounted(pkt);
    if (ret < 0)
        return ret;
    av_packet_move_ref(ctx->internal->buffer_pkt, pkt);

    return 0;
}


auto nbsfRet = av_bsf_receive_packet(m_bsfc, &m_pktFiltered);
int av_bsf_receive_packet(AVBSFContext *ctx, AVPacket *pkt)
{
    return ctx->filter->filter(ctx, pkt);
}

static int h264_mp4toannexb_filter(AVBSFContext *ctx, AVPacket *out)
{
    H264BSFContext *s = ctx->priv_data;

    AVPacket *in;
    uint8_t unit_type;
    int32_t nal_size;
    uint32_t cumul_size    = 0;
    const uint8_t *buf;
    const uint8_t *buf_end;
    int            buf_size;
    int ret = 0, i;
//获取到pkt
    ret = ff_bsf_get_packet(ctx, &in);
    if (ret < 0)
        return ret;
/ff_bsf_get_packet
int ff_bsf_get_packet(AVBSFContext *ctx, AVPacket **pkt)
{
    AVBSFInternal *in = ctx->internal;
    AVPacket *tmp_pkt;

    if (in->eof)
        return AVERROR_EOF;

    if (!ctx->internal->buffer_pkt->data &&
        !ctx->internal->buffer_pkt->side_data_elems)
        return AVERROR(EAGAIN);

    tmp_pkt = av_packet_alloc();
    if (!tmp_pkt)
        return AVERROR(ENOMEM);

    *pkt = ctx->internal->buffer_pkt;
    ctx->internal->buffer_pkt = tmp_pkt;

    return 0;
}
/

    /* nothing to filter */
    //记得前面already annex B 这里就直接返回了
    if (!s->extradata_parsed) {
        av_packet_move_ref(out, in);
        av_packet_free(&in);
        return 0;
    }

    buf      = in->data;
    buf_size = in->size;
    buf_end  = in->data + in->size;

    do {
        ret= AVERROR(EINVAL);
        if (buf + s->length_size > buf_end)
            goto fail;

        for (nal_size = 0, i = 0; i<s->length_size; i++)
            nal_size = (nal_size << 8) | buf[i];

        buf += s->length_size;
        unit_type = *buf & 0x1f;

        if (nal_size > buf_end - buf || nal_size < 0)
            goto fail;

        if (unit_type == H264_NAL_SPS)
            s->idr_sps_seen = s->new_idr = 1;
        else if (unit_type == H264_NAL_PPS) {
            s->idr_pps_seen = s->new_idr = 1;
            /* if SPS has not been seen yet, prepend the AVCC one to PPS */
            if (!s->idr_sps_seen) {
                if (s->sps_offset == -1)
                    av_log(ctx, AV_LOG_WARNING, "SPS not present in the stream, nor in AVCC, stream may be unreadable\n");
                else {
                    if ((ret = alloc_and_copy(out,
                                         ctx->par_out->extradata + s->sps_offset,
                                         s->pps_offset != -1 ? s->pps_offset : ctx->par_out->extradata_size - s->sps_offset,
                                         buf, nal_size, 1)) < 0)
                        goto fail;
                    s->idr_sps_seen = 1;
                    goto next_nal;
                }
            }
        }

        /* if this is a new IDR picture following an IDR picture, reset the idr flag.
         * Just check first_mb_in_slice to be 0 as this is the simplest solution.
         * This could be checking idr_pic_id instead, but would complexify the parsing. */
        if (!s->new_idr && unit_type == H264_NAL_IDR_SLICE && (buf[1] & 0x80))
            s->new_idr = 1;

        /* prepend only to the first type 5 NAL unit of an IDR picture, if no sps/pps are already present */
        if (s->new_idr && unit_type == H264_NAL_IDR_SLICE && !s->idr_sps_seen && !s->idr_pps_seen) {
        //关键帧拷贝的时候同sps pps一起拷贝
            if ((ret=alloc_and_copy(out,
                               ctx->par_out->extradata, ctx->par_out->extradata_size,
                               buf, nal_size, 1)) < 0)
                goto fail;
            s->new_idr = 0;
        /* if only SPS has been seen, also insert PPS */
        } else if (s->new_idr && unit_type == H264_NAL_IDR_SLICE && s->idr_sps_seen && !s->idr_pps_seen) {
            if (s->pps_offset == -1) {
                av_log(ctx, AV_LOG_WARNING, "PPS not present in the stream, nor in AVCC, stream may be unreadable\n");
                if ((ret = alloc_and_copy(out, NULL, 0, buf, nal_size, 0)) < 0)
                    goto fail;
            } else if ((ret = alloc_and_copy(out,
                                        ctx->par_out->extradata + s->pps_offset, ctx->par_out->extradata_size - s->pps_offset,
                                        buf, nal_size, 1)) < 0)
                goto fail;
        } else {
            if ((ret=alloc_and_copy(out, NULL, 0, buf, nal_size, unit_type == H264_NAL_SPS || unit_type == H264_NAL_PPS)) < 0)
                goto fail;
            if (!s->new_idr && unit_type == H264_NAL_SLICE) {
                s->new_idr = 1;
                s->idr_sps_seen = 0;
                s->idr_pps_seen = 0;
            }
        }

next_nal:
        buf        += nal_size;
        cumul_size += nal_size + s->length_size;
    } while (cumul_size < buf_size);

    ret = av_packet_copy_props(out, in);
    if (ret < 0)
        goto fail;

fail:
    if (ret < 0)
        av_packet_unref(out);
    av_packet_free(&in);

    return ret;
}

typedef struct AVCodecParameters {
    /**
     * General type of the encoded data.
     */
    enum AVMediaType codec_type;
    /**
     * Specific type of the encoded data (the codec used).
     */
    enum AVCodecID   codec_id;
    /**
     * Additional information about the codec (corresponds to the AVI FOURCC).
     */
    uint32_t         codec_tag;

    /**
     * Extra binary data needed for initializing the decoder, codec-dependent.
     *
     * Must be allocated with av_malloc() and will be freed by
     * avcodec_parameters_free(). The allocated size of extradata must be at
     * least extradata_size + AV_INPUT_BUFFER_PADDING_SIZE, with the padding
     * bytes zeroed.
     * 0 00 00 01 67 64 00 32 AC D9 40 28 00 B5 A6 C8 00 00 03 00 08 00 00 03 01 90 78 C1 8C B0 00 00 00 01 68 EB E3 CB 22 C0 
     */
    uint8_t *extradata;//这个参数中存储就是pps sps
    /**
     * Size of the extradata content in bytes.
     */
    int      extradata_size;//

    /**
     * - video: the pixel format, the value corresponds to enum AVPixelFormat.
     * - audio: the sample format, the value corresponds to enum AVSampleFormat.
     */
    int format;

    /**
     * The average bitrate of the encoded data (in bits per second).
     */
    int64_t bit_rate;

    /**
     * The number of bits per sample in the codedwords.
     *
     * This is basically the bitrate per sample. It is mandatory for a bunch of
     * formats to actually decode them. It's the number of bits for one sample in
     * the actual coded bitstream.
     *
     * This could be for example 4 for ADPCM
     * For PCM formats this matches bits_per_raw_sample
     * Can be 0
     */
    int bits_per_coded_sample;

    /**
     * This is the number of valid bits in each output sample. If the
     * sample format has more bits, the least significant bits are additional
     * padding bits, which are always 0. Use right shifts to reduce the sample
     * to its actual size. For example, audio formats with 24 bit samples will
     * have bits_per_raw_sample set to 24, and format set to AV_SAMPLE_FMT_S32.
     * To get the original sample use "(int32_t)sample >> 8"."
     *
     * For ADPCM this might be 12 or 16 or similar
     * Can be 0
     */
    int bits_per_raw_sample;

    /**
     * Codec-specific bitstream restrictions that the stream conforms to.
     */
    int profile;
    int level;

    /**
     * Video only. The dimensions of the video frame in pixels.
     */
    int width;
    int height;

    /**
     * Video only. The aspect ratio (width / height) which a single pixel
     * should have when displayed.
     *
     * When the aspect ratio is unknown / undefined, the numerator should be
     * set to 0 (the denominator may have any value).
     */
    AVRational sample_aspect_ratio;

    /**
     * Video only. The order of the fields in interlaced video.
     */
    enum AVFieldOrder                  field_order;

    /**
     * Video only. Additional colorspace characteristics.
     */
    enum AVColorRange                  color_range;
    enum AVColorPrimaries              color_primaries;
    enum AVColorTransferCharacteristic color_trc;
    enum AVColorSpace                  color_space;
    enum AVChromaLocation              chroma_location;

    /**
     * Video only. Number of delayed frames.
     */
    int video_delay;

    /**
     * Audio only. The channel layout bitmask. May be 0 if the channel layout is
     * unknown or unspecified, otherwise the number of bits set must be equal to
     * the channels field.
     */
    uint64_t channel_layout;
    /**
     * Audio only. The number of audio channels.
     */
    int      channels;
    /**
     * Audio only. The number of audio samples per second.
     */
    int      sample_rate;
    /**
     * Audio only. The number of bytes per coded audio frame, required by some
     * formats.
     *
     * Corresponds to nBlockAlign in WAVEFORMATEX.
     */
    int      block_align;
    /**
     * Audio only. Audio frame size, if known. Required by some formats to be static.
     */
    int      frame_size;

    /**
     * Audio only. The amount of padding (in samples) inserted by the encoder at
     * the beginning of the audio. I.e. this number of leading decoded samples
     * must be discarded by the caller to get the original audio without leading
     * padding.
     */
    int initial_padding;
    /**
     * Audio only. The amount of padding (in samples) appended by the encoder to
     * the end of the audio. I.e. this number of decoded samples must be
     * discarded by the caller from the end of the stream to get the original
     * audio without any trailing padding.
     */
    int trailing_padding;
    /**
     * Audio only. Number of samples to skip after a discontinuity.
     */
    int seek_preroll;
} AVCodecParameters;

我们接着来分析：

static int h264_mp4toannexb_filter(AVBSFContext *ctx, AVPacket *out)
{
    H264BSFContext *s = ctx->priv_data;

    AVPacket *in;
    uint8_t unit_type;
    int32_t nal_size;
    uint32_t cumul_size    = 0;
    const uint8_t *buf;
    const uint8_t *buf_end;
    int            buf_size;
    int ret = 0, i;

    ret = ff_bsf_get_packet(ctx, &in);
    if (ret < 0)
        return ret;

    /* nothing to filter */
    if (!s->extradata_parsed) {
        av_packet_move_ref(out, in);
        av_packet_free(&in);
        return 0;
    }

    buf      = in->data;
    buf_size = in->size;
    buf_end  = in->data + in->size;

    do {
        ret= AVERROR(EINVAL);
        if (buf + s->length_size > buf_end)
            goto fail;

        for (nal_size = 0, i = 0; i<s->length_size; i++)
            nal_size = (nal_size << 8) | buf[i];

        buf += s->length_size;
        unit_type = *buf & 0x1f;

        if (nal_size > buf_end - buf || nal_size < 0)
            goto fail;

        if (unit_type == H264_NAL_SPS)
            s->idr_sps_seen = s->new_idr = 1;
        else if (unit_type == H264_NAL_PPS) {
            s->idr_pps_seen = s->new_idr = 1;
            /* if SPS has not been seen yet, prepend the AVCC one to PPS */
            if (!s->idr_sps_seen) {
                if (s->sps_offset == -1)
                    av_log(ctx, AV_LOG_WARNING, "SPS not present in the stream, nor in AVCC, stream may be unreadable\n");
                else {
                    if ((ret = alloc_and_copy(out,
                                         ctx->par_out->extradata + s->sps_offset,
                                         s->pps_offset != -1 ? s->pps_offset : ctx->par_out->extradata_size - s->sps_offset,
                                         buf, nal_size, 1)) < 0)
                        goto fail;
                    s->idr_sps_seen = 1;
                    goto next_nal;
                }
            }
        }

        /* if this is a new IDR picture following an IDR picture, reset the idr flag.
         * Just check first_mb_in_slice to be 0 as this is the simplest solution.
         * This could be checking idr_pic_id instead, but would complexify the parsing. */
        if (!s->new_idr && unit_type == H264_NAL_IDR_SLICE && (buf[1] & 0x80))
            s->new_idr = 1;

        /* prepend only to the first type 5 NAL unit of an IDR picture, if no sps/pps are already present */
        if (s->new_idr && unit_type == H264_NAL_IDR_SLICE && !s->idr_sps_seen && !s->idr_pps_seen) {
            if ((ret=alloc_and_copy(out,
                               ctx->par_out->extradata, ctx->par_out->extradata_size,
                               buf, nal_size, 1)) < 0)
                goto fail;
            s->new_idr = 0;
        /* if only SPS has been seen, also insert PPS */
        } else if (s->new_idr && unit_type == H264_NAL_IDR_SLICE && s->idr_sps_seen && !s->idr_pps_seen) {
            if (s->pps_offset == -1) {
                av_log(ctx, AV_LOG_WARNING, "PPS not present in the stream, nor in AVCC, stream may be unreadable\n");
                if ((ret = alloc_and_copy(out, NULL, 0, buf, nal_size, 0)) < 0)
                    goto fail;
            } else if ((ret = alloc_and_copy(out,
                                        ctx->par_out->extradata + s->pps_offset, ctx->par_out->extradata_size - s->pps_offset,
                                        buf, nal_size, 1)) < 0)
                goto fail;
        } else {
            if ((ret=alloc_and_copy(out, NULL, 0, buf, nal_size, unit_type == H264_NAL_SPS || unit_type == H264_NAL_PPS)) < 0)
                goto fail;
            if (!s->new_idr && unit_type == H264_NAL_SLICE) {
                s->new_idr = 1;
                s->idr_sps_seen = 0;
                s->idr_pps_seen = 0;
            }
        }

next_nal:
        buf        += nal_size;
        cumul_size += nal_size + s->length_size;
    } while (cumul_size < buf_size);

    ret = av_packet_copy_props(out, in);
    if (ret < 0)
        goto fail;

fail:
    if (ret < 0)
        av_packet_unref(out);
    av_packet_free(&in);

    return ret;
}

你可能感兴趣的:(FFmpeg)

构建jdk1.8+ffmpeg镜像 wcy10086 ffmpeg docker jdk
FROMharbor.**.cn/library/jdk1.8:v2.0MAINTAINERwcyRUNyuminstall-yepel-releaseRUNrpm--importhttp://li.nux.ro/download/nux/RPM-GPG-KEY-nux.roRUNrpm-Uvhhttp://li.nux.ro/download/nux/dextop/el7/x86_64/nux-
FFmpeg-- c++实现：音频流aac和视频流h264封装八月的雨季997 FFmpeg ffmpeg c++音视频
文章目录流程api核心代码muxer.hmuxer.cppaac和h264封装为视频流，封装为c++的Muxter类流程分配视频文件上下文intInit(constchar*url);创建流，赋值给视频的音频流和视频流intAddStream(AVCodecContext*codec_ctx);写视频流的headintSendHeader();写视频流的packet，需要转换packet的pts和
qt+ffmpeg 实现音视频播放（一）码农客栈音视频 Qt ffmpeg qt ffmpeg 音视频
一、ffmpeg下载官网：点击跳转二、模块介绍1.libavcodec：音视频编解码库，提供了多种编解码器，可以支持多种音视频格式的编解码操作。2.libavformat：音视频封装和解封装库，提供了多种封装格式的支持，可以读取和写入多种音视频文件格式。3.libavfilter：音视频过滤器库，提供了多种音视频过滤器，可以对音视频流进行各种处理，如添加水印、调整亮度、对比度等。4.libavde
FFmpeg将视频包AVPacket通过视频流方式写入本地文件林鸿群 ffmpeg 音视频
1.写视频头voidwriteVideoHeader(constchar*videoFileName){intr=avformat_alloc_output_context2(&pFormatCtx,nullptr,nullptr,videoFileName);if(rstreams[0]->codecpar->width=1280;pFormatCtx->streams[0]->codecpar
Spring Boot 多媒体（音频/视频）文件处理FFmpegFrameGrabber 方法(例子：获取视频总时长) 编程社区管理员 spring boot 音视频 java
1.pom.xml坐标org.bytedecojavacv-platform1.5.62.FFmpegFrameGrabber类提供了多种方法来处理多媒体文件，以下是一些常用的方法start()：开始抓取帧。在调用此方法之前，可以设置格式和选项，之后可以调用grab()方法来获取帧。stop()：停止抓取帧。通常在完成帧的抓取后调用此方法来释放资源。getLengthInTime()：获取视频的长
基于Node.js 和 FFmpeg构建自动化脚本用来转码视频接着奏乐接着舞。工作经验总结 node.js ffmpeg 自动化
这个脚本将监控一个特定的目录，自动转码新添加的视频文件，并将转码后的视频保存到指定目录。准备环境安装Node.js:访问Node.js官网，下载并安装适合你操作系统的Node.js版本。安装FFmpeg(不清楚的可以看我的上篇博客里有详细的安装步骤):对于Windows用户，可以从FFmpeg官方网站下载预编译的二进制文件，解压，并将bin目录添加到系统的环境变量中。macOS用户可以使用Home
FFmpeg封装函数avformat_open_input() 肖爱Kun RTSP网络视屏协议 c++
FFmpeg在调用avformat_open_input()之后，可能码流信息不够完整，可以使用avformat_find_stream_info()获取更多的码流信息。比如获取视频帧率、视频宽高，重新计算最大分析时长，打开解码器解码获取codec数据。函数原型如下：intavformat_find_stream_info(AVFormatContext*ic,AVDictionary**opti
ubuntu nginx linux实践操作记录 ubuntu nginx linux
FFmpeg+nginx-http-flv-module+flv.js实现视频流播放-一只小松徐吖(xaoxu.cn)cat/etc/nginx/sites-enabled/default###YoushouldlookatthefollowingURL'sinordertograspasolidunderstanding#ofNginxconfigurationfilesinordertoful
FFmpeg --播放器框架及api使用八月的雨季997 FFmpeg ffmpeg 音视频
播放器框架1媒体文件:AVFormatContextavformat_alloc_contextavformat_open_input2解复用器：AVStreamav_read_frame—3音频（视频）包队列：AVPacket4音频（视频）解码：AVCodecContextavcodec_send_packetavcodec_receive_frame采样(音频)/图像帧队列：AVFrame音频
真快！几分钟就把视频语音识别为文本了，不到10行代码诗者才子酒中仙音视频语音识别人工智能
虽然已经很简单了，但是对于程序员来说还是不够简洁，毕竟程序员都很“懒”，Whisper虽说安装和调用已经很简单了，但还是需要独立安装PyTorch、ffmpeg甚至Rust。将音视频文件中的音频转为文字内容，这个需求放到两年前还不大好实现，但是放到今天，几分钟就解决了。听说有的公司为了抓取训练数据，已经把抖音、快手这些短视频平台上的视频扒了个遍，然后将其中的音频提取成文本，用作大数据模型的训练语料
怎么快速编辑视频拓源视频音视频 ffmpeg
背景：怎么简单快速编辑视频利用FFmpeg功能，简单快速编辑视频，如按9:16提前剪切视频、替换背景音乐。下载FFmpeg：https://ffmpeg.org/download.html将FFmpeg的路径添加到环境变量中：Windows：在系统的环境变量中添加FFmpeg的路径。Linux/MacOS：在shell配置文件（例如~/.bashrc或~/.bash_profile）中添加FFmp
QT MinGW64编译vlc源码小条小杂鱼 QT学习实战 qt 开发语言
编译环境搭建参考文章《QTMingw32/64编译ffmpeg源码生成32/64bit库以及测试》，搭建msys64环境；运行msys.exe,运行：pacman-Sgitsubversioncvsautomakeautoconflibtoolm4makegettextpkg-configmingw-w64-i686-luafindutilsmingw-w64-i686-headersyasmpa
ffmpeg：单张图片 + 音频生成视频 KAMILLE ffmpeg
ffmpeg-r1-fimage2-loop1-i图片地址-i音频地址-s1920x1080-pix_fmtyuvj420p-t时长(秒)-vcodeclibx264视频地址帧率为1，转换速度更快。如果想根据音频的时长：ffmpeg-y-loop1-r1-i图片地址-i2.音频地址-vcodeclibx264-acodecaac-shortest视频地址ffmpeg-y-loop1-r1-i图片地
在C++程序中给视频添加文字水印 ygwelcome c++音视频开发语言 ffmpeg
有时候，我们需要给视频添加文字或水印，用已有的工具当然最简单，但想在自己的应用中，如C++应用程序中来实现，如何实现呢？假设采用FFmpeg库，可通过C++二次开发调用实现。当然这个过程还是比较复杂的，需要有一定的多媒体编程能力并使用FFmpeg的多媒体处理功能。可按以下步骤：1、安装FFmpeg：首先，确保你的系统上已经安装了FFmpeg。你可以从FFmpeg官方网站下载二进制文件，或者使用包管
C#中用ffmpeg截取视频使用要点两仪风 ffmpeg 音视频
一、代码stringinputFile="E:\\Test\\1\\5.mp4";stringoutputFile="E:\\Test\\1\\10.mp4";intstartTime=5;//开始时间（秒）intendtime=10;//结束时间（秒）Processp=newProcess();p.StartInfo.FileName=".\\ffmpeg\\ffmpeg.exe";//ffmp
使用python+ffmpeg把一个大视频切片成多个小视频，批量处理多个大视频的切片 mj412828668 Python ffmpeg python 音视频开发语言
#encoding=utf-8importosimportitertoolsdefmain():#使用前，要先配置好ffmpeg的环境变量，并删除videos_path中txt文件夹下的所有文件ffmpeg_path="D:\\FFmpeg\\bin\\ffmpeg"videos_path="C:\\Users\\Yan\\Desktop\\videos"concat_list_path=vide
python-使用ffmpeg批量修改文件的后缀名 Lulifer。批量改名
importosimportsubprocessdefconvert_ogg_to_mp3(directory):forfilenameinos.listdir(directory):iffilename.endswith(".ogg"):#获取文件的完整路径file_path=os.path.join(directory,filename)#创建一个新的文件名，只是将扩展名从.ogg更改为.mp
RK3588平台开发系列讲解（视频篇）ffmpeg 的移植内核笔记 RK3588 Android12 开发入门到精通专栏 RK3588
文章目录一、ffmpeg介绍二、ffmpeg的组成三、ffmpeg依赖库沉淀、分享、成长，让自己和他人都能有所收获！ffmpeg是一种多媒体音视频处理工具，具备视频采集功能、视频抓取图像、视频格式转换、给视频加水印并能将视频转化为流等诸多强大的功能。它采用LGPL或GPL许可证，是一种开源程序。一、ffmpeg介绍FFmpeg主要特点和功能：多媒体格式支持：FFmpeg支持几乎所有常见的音视频格式
python工具方法 45 基于ffmpeg以面向对象多线程的方式实现实时推流万里鹏程转瞬至 python工具方法 python ffmpeg 开发语言
1、视频推流参考基于ffmpeg模拟监控摄像头输出rtsp视频流并opencv播放实现视频流的推流。其基本操作就是，安装视频流推流服务器，ffmpeg，准备好要推流的视频。命令如下所示：ffmpeg-re-stream_loop-1-i风景视频素材分享.flv-ccopy-frtsprtsp://127.0.0.1:554/input其中风景视频素材分享.flv为文件名称，rtsp://127.0
Linux/CentOS安装ZLMediaKit流媒体服务 linuxcentos
一、centoslinux下安装ffmpeg1、下载解压wgethttp://www.ffmpeg.org/releases/ffmpeg-3.1.tar.gztar-zxvfffmpeg-3.1.tar.gz2、进入解压后目录,输入如下命令/usr/local/ffmpeg为自己指定的安装目录cdffmpeg-3.1./configure--prefix=/usr/local/ffmpegmak
Java调用FFmpeg将视频和音频合并成新视频的示例 javalinux
importjava.io.BufferedReader;importjava.io.IOException;importjava.io.InputStreamReader;importjava.util.ArrayList;importjava.util.List;publicclassFFmpegUtil{/***合并视频和音频*@paramvideoPath视频文件路径*@paramaudi
Android ffmpeg入门（1）—— 使用NDK交叉编译ffmpeg集成到Android项目孙先森i ANdroid NDK 开发学习 android 音视频 ffmpeg android ndk ndk
使用NDK交叉编译ffmpeg前言必备基础准备工作编写ffmpeg编译脚本Android项目集成新建项目导入ffmpeg集成测试前言最近在学习androidNDK开发相关内容，借ffmpeg练练手。ffmpeg是做音视频方面功能的基础，后面会随着个人的学习更新一系列ffmpeg博客，防止自己遗忘。这个系列博客主要目的是基于ffmpeg通过NDK开发的方式完成一个基本的视频播放器。本篇博客主要实现了
FFMPEG（一）华为云linux下编译ffmpeg for Android 冉航--小虾米 ffmpge android linux 华为
一、下载1.下载NDK1.1创建好目录结构华为云Ubuntulinux默认进来是在root根目录下，我们使用mkdirandroid命令创建一个android文件夹，然后cdandroid进入android文件夹下，再mkdirNDK创建一个目录，最终下载存放ndk的目录是/root/android/NDK。1.2下载NDK进入/root/android/NDK目录,使用如下命令下载NDK:wge
FFmpeg编程录制音频（Mac OS）老张音视频开发进阶 ffmpeg 音视频
之前我们使用FFmpeg命令行工具进行了简单的音视频操作，这次在MacOS环境下编写代码实现简单的音频录制功能。FFmpeg命令行音频录制首先回顾一下MacOS环境下简单的音频录制命令行实现：ffmpeg-favfoundation-i":0"-t20-acodecpcm_s16le-ar44100-ac2~/Desktop/output.wav参数说明：•-favfoundation：指定输入设
ffmpeg for android编译全过程与遇到的问题老张音视频开发进阶 ffmpeg android
编译前准备编译环境：Ubuntu16，可自行下载VMWare最新版并百度永久许可证或在服务器上安装Ubuntuffmpeg源码：ffmpeg4.2.2NDK下载：AndroidNDKr21e有条件的最好还是在Liunx平台下编译吧，Windows平台下编译坑更多，文章末尾有Github源码可自取开始编译1.解压NDK，执行unzipandroid-ndk-r21e-liunx-x86_64.zip
linux下安装ffmpeg的详细教程服务器linux
一、centos下安装ffmpeg1、下载解压wgethttp://www.ffmpeg.org/releases/ffmpeg-5.1.tar.gztar-zxvfffmpeg-4.1.tar.gz2、进入解压后目录,输入如下命令/usr/local/ffmpeg为自己指定的安装目录cdffmpeg-5.1./configure--prefix=/usr/local/ffmpegmake&&ma
使用openai-whisper实现语音转文字 MasonYyp whisper
使用openai-whisper实现语音转文字1安装依赖1.1Windows下安装ffmpegFFmpeg是一套可以用来记录、转换数字音频、视频，并能将其转化为流的开源计算机程序。采用LGPL或GPL许可证。它提供了录制、转换以及流化音视频的完整解决方案。#ffmpeg官网https://ffmpeg.org/#ffmpeg下载地址https://ffmpeg.org/download.html#
基于python使用ffmpeg打包exe后更换电脑操作(之一不添加环境变量而使用) 疯狂的豆包 python python
Python中使用ffmpeg，是借助它的音视频处理功能，但当打包exe后，只把exe文件放到其他电脑上，并不能运行。解决方式是将在官网上下载的ffmpeg所有文件夹与exe文件放在同一文件夹下。但你会发现仍然不能使用，故这里介绍两种方式：第一种方式，当使用的电脑可以设置环境变量时，按如下操作：在电脑中找到“高级系统设置”并点击“环境变量”，在变量“Path”下设置变量(ffmpeg所在的路径)第
Qt实用技巧：QCustomPlot做北斗GPS显示绝对位置运动轨迹和相对位置运动轨迹图的时，使图按照输入点顺序连曲线长沙红胖子Qt软件开发北斗轨迹图 GPS轨迹图绝对位置相对位置轨迹图 Qt
若该文为原创文章，转载请注明原文出处本文章博客地址：https://hpzwl.blog.csdn.net/article/details/136131310红胖子网络科技博文大全：开发技术集合（包含Qt实用技术、树莓派、三维、OpenCV、OpenGL、ffmpeg、OSG、单片机、软硬结合等等）持续更新中…Qt开发专栏：实用技巧需求使用QCustomPlot绘制多个目标的北斗运行轨迹图，包
使用 shell 脚本下载 ffmpeg并解压（mac下）属七降九
ffmpeg-download.sh脚本代码#!/bin/bash#库名称source="ffmpeg-3.4"#下载这个库if[!-r$source]then#没有下载，那么我需要执行下载操作echo"没有FFmpeg库，我们需要下载….."#下载：怎么下载？#"curl"命令表示：它可以通过Http\ftp等等这样的网络方式下载和上传文件（它是一个强大网络工具）#基本格式：curl地址#指定下
矩阵求逆（JAVA）利用伴随矩阵 qiuwanchi 利用伴随矩阵求逆矩阵
package gaodai.matrix; import gaodai.determinant.DeterminantCalculation; import java.util.ArrayList; import java.util.List; import java.util.Scanner; /** * 矩阵求逆(利用伴随矩阵) * @author 邱万迟
单例（Singleton）模式 aoyouzi 单例模式 Singleton
3.1 概述如果要保证系统里一个类最多只能存在一个实例时，我们就需要单例模式。这种情况在我们应用中经常碰到，例如缓存池，数据库连接池，线程池，一些应用服务实例等。在多线程环境中，为了保证实例的唯一性其实并不简单，这章将和读者一起探讨如何实现单例模式。 3.2
[开源与自主研发]就算可以轻易获得外部技术支持,自己也必须研发 comsci 开源
现在国内有大量的信息技术产品，都是通过盗版，免费下载，开源，附送等方式从国外的开发者那里获得的。。。。。。虽然这种情况带来了国内信息产业的短暂繁荣，也促进了电子商务和互联网产业的快速发展，但是实际上，我们应该清醒的看到，这些产业的核心力量是被国外的
页面有两个frame,怎样点击一个的链接改变另一个的内容 Array_06 UI XHTML
<a src="地址" targets="这里写你要操作的Frame的名字" />搜索然后你点击连接以后你的新页面就会显示在你设置的Frame名字的框那里 targerts="",就是你要填写目标的显示页面位置 ===================== 例如： <frame src=&
Struts2实现单个/多个文件上传和下载 oloz 文件上传 struts
struts2单文件上传：步骤01:jsp页面  　　<form action="fileUplo
推荐10个在线logo设计网站 362217990 logo
在线设计Logo网站。 1、http://flickr.nosv.org（这个太简单） 2、http://www.logomaker.com/?source=1.5770.1 3、http://www.simwebsol.com/ImageTool 4、http://www.logogenerator.com/logo.php?nal=1&tpl_catlist[]=2 5、ht
jsp上传文件香水浓 jsp fileupload
1. jsp上传 Notice： 1. form表单 method 属性必须设置为 POST 方法，不能使用 GET 方法 2. form表单 enctype 属性需要设置为 multipart/form-data 3. form表单 action 属性需要设置为提交到后台处理文件上传的jsp文件地址或者servlet地址。例如 uploadFile.jsp 程序文件用来处理上传的文
我的架构经验系列文章 - 前端架构 agevs JavaScript Web 框架 UI jQuer
框架层面：近几年前端发展很快，前端之所以叫前端因为前端是已经可以独立成为一种职业了，js也不再是十年前的玩具了，以前富客户端RIA的应用可能会用flash/flex或是silverlight，现在可以使用js来完成大部分的功能，因此js作为一门前端的支撑语言也不仅仅是进行的简单的编码，越来越多框架性的东西出现了。越来越多的开发模式转变为后端只是吐json的数据源，而前端做所有UI的事情。MVCMV
android ksoap2 中把XML(DataSet) 当做参数传递 aijuans android
我的android app中需要发送webservice ，于是我使用了 ksop2 进行发送，在测试过程中不是很顺利,不能正常工作.我的web service 请求格式如下 [html] view plain copy <Envelope xmlns="http://schemas.
使用Spring进行统一日志管理 + 统一异常管理 baalwolf spring
统一日志和异常管理配置好后，SSH项目中，代码以往散落的log.info() 和 try..catch..finally 再也不见踪影！统一日志异常实现类： [java] view plain copy package com.pilelot.web.util; impor
Android SDK 国内镜像 BigBird2012 android sdk
一、镜像地址： 1、东软信息学院的 Android SDK 镜像，比配置代理下载快多了。配置地址， http://mirrors.neusoft.edu.cn/configurations.we#android 2、北京化工大学的： IPV4:ubuntu.buct.edu.cn IPV4:ubuntu.buct.cn IPV6:ubuntu.buct6.edu.cn
HTML无害化和Sanitize模块 bijian1013 JavaScript AngularJS Linky Sanitize
一.ng-bind-html、ng-bind-html-unsafe AngularJS非常注重安全方面的问题，它会尽一切可能把大多数攻击手段最小化。其中一个攻击手段是向你的web页面里注入不安全的HTML，然后利用它触发跨站攻击或者注入攻击。考虑这样一个例子，假设我们有一个变量存
[Maven学习笔记二]Maven命令 bit1129 maven
mvn compile compile编译命令将src/main/java和src/main/resources中的代码和配置文件编译到target/classes中，不会对src/test/java中的测试类进行编译 MVN编译使用 maven-resources-plugin:2.6:resources maven-compiler-plugin:2.5.1:compile &nbs
【Java命令二】jhat bit1129 Java命令
jhat用于分析使用jmap dump的文件，，可以将堆中的对象以html的形式显示出来，包括对象的数量，大小等等，并支持对象查询语言。 jhat默认开启监听端口7000的HTTP服务，jhat是Java Heap Analysis Tool的缩写 1. 用法： [hadoop@hadoop bin]$ jhat -help Usage: jhat [-stack <bool&g
JBoss 5.1.0 GA:Error installing to Instantiated: name=AttachmentStore state=Desc ronin47
进到类似目录 server/default/conf/bootstrap，打开文件 profile.xml找到： Xml代码<bean name="AttachmentStore" class="org.jboss.system.server.profileservice.repository.AbstractAtta
写给初学者的6条网页设计安全配色指南 brotherlamp UI ui自学 ui视频 ui教程 ui资料
网页设计中最基本的原则之一是，不管你花多长时间创造一个华丽的设计，其最终的角色都是这场秀中真正的明星——内容的衬托我仍然清楚地记得我最早的一次美术课，那时我还是一个小小的、对凡事都充满渴望的孩子，我摆放出一大堆漂亮的彩色颜料。我仍然记得当我第一次看到原色与另一种颜色混合变成第二种颜色时的那种兴奋，并且我想，既然两种颜色能创造出一种全新的美丽色彩，那所有颜色
有一个数组，每次从中间随机取一个，然后放回去，当所有的元素都被取过，返回总共的取的次数。写一个函数实现。复杂度是什么。 bylijinnan java 算法面试
import java.util.Random; import java.util.Set; import java.util.TreeSet; /** * http://weibo.com/1915548291/z7HtOF4sx * #面试题#有一个数组，每次从中间随机取一个，然后放回去，当所有的元素都被取过，返回总共的取的次数。 * 写一个函数实现。复杂度是什么
struts2获得request、session、application方式 chiangfai application
1、与Servlet API解耦的访问方式。 a.Struts2对HttpServletRequest、HttpSession、ServletContext进行了封装，构造了三个Map对象来替代这三种对象要获取这三个Map对象，使用ActionContext类。 -----> package pro.action; import java.util.Map; imp
改变python的默认语言设置 chenchao051 python
import sys sys.getdefaultencoding() 可以测试出默认语言，要改变的话，需要在python lib的site-packages文件夹下新建： sitecustomize.py，这个文件比较特殊，会在python启动时来加载，所以就可以在里面写上： import sys sys.setdefaultencoding('utf-8') &n
mysql导入数据load data infile用法 daizj mysql 导入数据
我们常常导入数据！mysql有一个高效导入方法，那就是load data infile 下面来看案例说明基本语法： load data [low_priority] [local] infile 'file_name txt' [replace | ignore] into table tbl_name [fields [terminated by't'] [OPTI
phpexcel导入excel表到数据库简单入门示例 dcj3sjt126com PHP Excel
跟导出相对应的，同一个数据表，也是将phpexcel类放在class目录下，将Excel表格中的内容读取出来放到数据库中 <?php error_reporting(E_ALL); set_time_limit(0); ?> <html> <head> <meta http-equiv="Content-Type"
22岁到72岁的男人对女人的要求 dcj3sjt126com
22岁男人对女人的要求是：一，美丽，二，性感，三，有份具品味的职业，四，极有耐性，善解人意，五，该聪明的时候聪明，六，作小鸟依人状时尽量自然，七，怎样穿都好看，八，懂得适当地撒娇，九，虽作惊喜反应，但看起来自然，十，上了床就是个无条件荡妇。 32岁的男人对女人的要求，略作修定，是：一，入得厨房，进得睡房，二，不必服侍皇太后，三，不介意浪漫蜡烛配盒饭，四，听多过说，五，不再傻笑，六，懂得独
Spring和HIbernate对DDM设计的支持 e200702084 DAO 设计模式 spring Hibernate 领域模型
A：数据访问对象 DAO和资源库在领域驱动设计中都很重要。DAO是关系型数据库和应用之间的契约。它封装了Web应用中的数据库CRUD操作细节。另一方面，资源库是一个独立的抽象，它与DAO进行交互，并提供到领域模型的“业务接口”。资源库使用领域的通用语言，处理所有必要的DAO，并使用领域理解的语言提供对领域模型的数据访问服务。
NoSql 数据库的特性比较 geeksun NoSQL
Redis 是一个开源的使用ANSI C语言编写、支持网络、可基于内存亦可持久化的日志型、Key-Value数据库，并提供多种语言的API。目前由VMware主持开发工作。 1. 数据模型作为Key-value型数据库，Redis也提供了键（Key）和值（Value）的映射关系。除了常规的数值或字符串，Redis的键值还可以是以下形式之一： Lists （列表） Sets
使用 Nginx Upload Module 实现上传文件功能 hongtoushizi nginx
转载自： http://www.tuicool.com/wx/aUrAzm 普通网站在实现文件上传功能的时候，一般是使用Python，Java等后端程序实现，比较麻烦。Nginx有一个Upload模块，可以非常简单的实现文件上传功能。此模块的原理是先把用户上传的文件保存到临时文件，然后在交由后台页面处理，并且把文件的原名，上传后的名称，文件类型，文件大小set到页面。下
spring-boot-web-ui及thymeleaf基本使用 jishiweili spring thymeleaf
视图控制层代码demo如下： @Controller @RequestMapping("/") public class MessageController { private final MessageRepository messageRepository; @Autowired public MessageController(Mes
数据源架构模式之活动记录 home198979 PHP 架构活动记录数据映射
hello!架构一、概念活动记录（Active Record）：一个对象，它包装数据库表或视图中某一行，封装数据库访问，并在这些数据上增加了领域逻辑。对象既有数据又有行为。活动记录使用直截了当的方法，把数据访问逻辑置于领域对象中。二、实现简单活动记录活动记录在php许多框架中都有应用，如cakephp。 <?php /** * 行数据入口类 *
Linux Shell脚本之自动修改IP pda158 linux centos Debian 脚本
作为一名 Linux SA，日常运维中很多地方都会用到脚本，而服务器的ip一般采用静态ip或者MAC绑定，当然后者比较操作起来相对繁琐，而前者我们可以设置主机名、ip信息、网关等配置。修改成特定的主机名在维护和管理方面也比较方便。如下脚本用途为：修改ip和主机名等相关信息，可以根据实际需求修改，举一反三！ #!/bin/sh #auto Change ip netmask ga
开发环境搭建独浮云 eclipse jdk tomcat
最近在开发过程中，经常出现MyEclipse内存溢出等错误，需要重启的情况，好麻烦。对于一般的JAVA+TOMCAT项目开发，其实没有必要使用重量级的MyEclipse，使用eclipse就足够了。尤其是开发机器硬件配置一般的人。 &n