<?xml version="1.0" encoding="UTF-8"?>
<events xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:noNamespaceSchemaLocation="UCBCNEvents_1.9.xsd">

	
    <event>
        <id>3728353</id>
        <event_id>962290</event_id>
        <eventinstance_id>3728353</eventinstance_id>
		<calendar>
			<id>3519</id>
			<title>CS/CRCV Seminars</title>
			<slug>cscrcv-seminars</slug>
			<url>https://events.ucf.edu/calendar/3519/cscrcv-seminars/</url>
		</calendar>
		<title>Advancing Temporal Action Localization: Efficient Large  Model Adaptation and Open-Vocabulary Recognition in Videos</title>
		<subtitle></subtitle>
		<start_date>Fri, 07 Feb 2025 14:00:00 -0500</start_date>
		<end_date>Fri, 07 Feb 2025 15:00:00 -0500</end_date>
		<location>Research 1</location>
		<room>101</room>
		<virtual_url><![CDATA[https://ucf.zoom.us/j/93193076532?pwd=lZcxqayQ8jhY6Fg2sPfJJfIXTrENC5.1&amp;from=addon]]></virtual_url>

		<address>
			<building></building>
			<city></city>
			<zipcode></zipcode>
			<state></state>
			<country>
				<id xmlns="urn:oasis:names:specification:ubl:schema:xsd:CommonAggregateComponents-1.0" codeListID="ISO3166-1" codeListAgencyID="6" codeListAgencyName="United Nations Economic Commission for Europe" codeListName="country" codeListVersionID="0.3" languageID="en" codeListURI="http://www.iso.org/iso/en/prods-services/iso3166ma/02iso-3166-code-lists/list-en1-semic.txt" codeListSchemeURI="urn:oasis:names:specification:ubl:schema:xsd:countryIdentificationCode-1.0">US</id>
				<name xmlns="urn:oasis:names:specification:ubl:schema:xsd:CommonBasicComponents-1.0">United States</name>
			</country>
		</address>

		<location_info>
			<location_type></location_type>
			<location_hours></location_hours>
			<location_phone></location_phone>
			<location_website><![CDATA[https://www.google.com/maps/place/4353+Scorpius+St,+Orlando,+FL+32816/@28.6029518,-81.1997953,17z/data=!3m1!4b1!4m5!3m4!1s0x88e7685b5a0352d5:0xf2a8108e6e846bc1!8m2!3d28.6029518!4d-81.1976066]]>
</location_website>
			<directions></directions>
			<map_link></map_link>
			<additional_public_info></additional_public_info>
		</location_info>

		<languages>
			<language>en-US</language>
		</languages>
		<event_transparency></event_transparency>

		<description><![CDATA[<p><strong>Speaker</strong>: Ms. Akshita Gupta</p>
<p><strong>From</strong>: TU Darmstadt</p>
<p><strong>Abstract</strong></p>
<p>In this talk, I will be discussing advancements in Temporal Action Localization (TAL) with a focus on two key innovations: Efficient Large Model Adaptation and Open-Vocabulary Recognition in Videos.&nbsp;</p>
<p>The first part of the talk introduces the Long-Short-range Adapter (LoSA), a memory-efficient backbone adapter designed for untrimmed videos. LoSA modifies intermediate layers across various temporal ranges to enhance video features, enabling end-to-end adaptation of billion-parameter models like VideoMAEv2. This approach ensures efficient utilization of state-of-the-art video models, even with the complexities of untrimmed video data.&nbsp;</p>
<p>The second part of the talk explores the OVFormer framework, which addresses Open-Vocabulary TAL. OVFormer leverages a language model to generate rich class descriptions and aligns these descriptions with video features using cross-attention. The framework employs a two-stage training strategy to enable generalization to novel categories, extending the range of recognizable actions beyond predefined categories.</p>
<p>Additionally, I will briefly discuss my internship work at Apple, where I worked on generating speech from videos of people and their transcripts.</p>
<p><span>For more info, please follow this</span><span> <a href="https://www.crcv.ucf.edu/wp-content/uploads/2018/11/Gupta-Flyer.pdf" target="_blank">link</a>.</span></p>]]></description>
		<short_description></short_description>

		<refreshments></refreshments>
		<webpages>
			<webpage>
				<title>Event Instance url</title>
				<url>https://events.ucf.edu/event/3728353/advancing-temporal-action-localization-efficient-large-model-adaptation-and-open-vocabulary-recognition-in-videos/</url>
			</webpage>
		</webpages>

		<contact_person>Cherry Place</contact_person>
		<contact_email>cherry@crcv.ucf.edu</contact_email>
		<contact_phone></contact_phone>

		<category>Speaker/Lecture/Seminar</category>
		<tags>
			<tag>UCFCRCV</tag>
		</tags>
		<registration_link></registration_link>
		<registration_info></registration_info>

		<status>Happening As Scheduled</status>
		<classification>Public</classification>
	</event>



</events>